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OVERVIEW 


This  project  initially  focused  on  the  development  of  synchronization  algorithms  and 
software  for  a  network  of  embedded  MICA  (or  Berkeley)  motes.  Automata  and  Petri  net 
models  of  operational  aspects  were  derived  and  verification  techniques  were  used  to 
guarantee  correctness.  These  results  were  reported  in  the  PI  presentations,  copies  of 
which  accompany  this  report  (Appendix  D).  A  major  part  of  this  project  was  dedicated  to 
the  Red  Force  Tagging  project.  Since  the  Red  Force  Tagging  (or  TagMote)  project 
incorporated  the  earlier  network  middleware  developed  in  this  project,  in  this  final  report 
we  have  primarily  focused  on  describing  in  full  detail  the  TagMote  project. 

Specifically,  in  the  first  part  of  this  report,  there  is  a  detailed  description  of  the  developed 
TagMote  hardware,  software  and  algorithms.  In  the  second  part,  fundamental 
contributions  to  the  communications  in  wireless  sensor  networks  are  described  with  most 
of  the  technical  details  included  in  the  publications,  copies  of  which  are  included  in  the 
Appendices.  The  networking  results  were  developed  for  the  most  part  during  the  second 
half  of  this  project  to  address  issues  brought  forth,  because  of  the  increasing  number  of 
sensor  nodes  in  the  applications  of  interest.  Note  that  in  this  project  additional  new 
results  on  overload  management  for  distributed  control  were  developed  to  address 
problems  arising  in  the  fairing  application  of  Boeing  (see  Appendix  C). 

In  summary,  the  first  part  of  this  report  describes  the  Red  Force  Tagging  project,  while 
the  second  part  describes  the  Networking  research  results.  The  Appendices  contain 
further  information  and  copies  of  publications.  Electronic  copies  of  the  PI  presentations 
also  accompany  this  report. 
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Section  1.  Executive  Summary 


In  this  section  the  goals  and  accomplishments  of  the  Red  Force  Tagging  project 
(TagMote  project)  are  briefly  described.  High  level  descriptions  of  the  TagMote 
operation,  the  flow  of  information,  the  hardware,  middleware  and  algorithms  are  given. 
Appendix  D  contains  copies  of  the  PI  meeting  presentations  where  the  TagMote 
hardware,  software  and  operation  are  described  in  detail. 

The  goal  of  this  TagMote  project  was  the  reconstruction  of  the  space/time  path  a  subject 
has  taken  from  an  initial  to  a  final  location.  The  reconstruction  is  based  on  Dead 
Reckoning  (DR)  and  it  works  in  areas  where  GPS  is  not  available,  for  example  in  urban 
canyons  and  inside  buildings.  Such  information  is  also  useful  to  civilians,  as  it  can  be 
used  for  personal  navigation  by  campers  and  hikers,  firemen  and  police.  It  addition,  such 
capability  can  be  used  for  mapping  an  area,  specifically  to  create  maps  of  areas  where 
GPS  is  not  available,  by  just  having  someone  walk  through  these  areas.  The 
reconstruction  algorithms  only  use  data  collected  by  a  special  MEMS  device  the  subject 
has  been  carrying.  When  the  subject  arrives  in  the  proximity  of  a  deployed  sensor 
network,  the  data  collected  on  the  portable  device  are  extracted  automatically  via  wireless 
communication  and  deposited  in  the  nodes  of  the  sensor  network.  These  data  that  are 
distributed  within  the  network  are  then  sent  to  a  collection  point  within  the  network  for 
further  processing  and  display. 

The  specific  objective  of  the  TagMote  project  was  to  demonstrate  the  feasibility  of 
building  such  mapping  device  (hardware,  software)  that  cooperates  with  the  NEST  sensor 
network  and  utilizes  middleware  to  accomplish  the  stated  goals.  This  objective  was 
accomplished. 

Our  approach  to  pedestrian  navigation  relies  on  a  specially  built  MEMS  based  device 
(called  the  TagMote)  that  includes  sensors,  memory,  a  processor  and  RF  communication 
capabilities.  This  device  uses  accelerometers  and  a  digital  magnetic  compass  to  collect 
data  regarding  distance  traveled  and  direction,  and  it  uses  memory  to  store  data  points. 
The  device  incorporates  Berkeley  motes  (Mica2)  to  provide  processing  power  and 
communications.  Special  software  implements  algorithms  that  provide  compensation  for 
adverse  sensor  orientation  and  calibration,  and  for  collection  with  possible  coding  of  data 
and  storage  on  a  nonvolatile  memory.  The  data  collection  and  part  of  the  processing 
takes  place  while  the  device  carried  by  the  subject  is  out  of  range  of  the  network.  When 
the  subject  enters  again  into  radio  range  of  the  network,  a  communication  link  is 
established  automatically  and  the  data  are  transmitted  initially  to  one  or  more  of  the 
network  motes  and  then  via  multi-hop  transmissions  to  a  designated  special  “relay”  or 
“base  node”  mote  for  further  processing.  Algorithms  detect  occurrence  of  steps  and 
calculate  distance  traveled,  and  use  direction  of  movement  and  distance  traveled  to 
generate  trajectory  points,  which  are  then  appropriately  displayed. 

There  were  several  successful  demonstrations  of  the  project  each  involving  the  following 
steps: 
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1.  Setup:  The  sensor  network  grid  was  physically  deployed,  initialized  and 
synchronized,  so  it  could  receive  data  from  a  mobile  mote  (TagMote)  that  joins 
the  network,  and  also  could  route  data,  that  have  been  downloaded  at  any  mote,  to 
the  designated  special  relay  mote  for  further  processing  and  display. 

2.  Initialization:  During  initialization,  the  special  device  (TagMote)  acquires  its 
position  and  time  to  be  used  as  reference.  For  the  field  experiment  this  was  hand 
programmed  into  the  device. 

3.  Operation:  A  subject  carrying  the  special  TagMote  walked  out  of  range  of  the 
network  for  some  time  interval,  walked  in  a  normal  fashion  tracing  some  path  and 
returned  within  range  of  the  network  at  some  later  time. 

4.  Data  transmission  to  the  network:  The  data  from  the  TagMote  were  downloaded 
wirelessly  to  network  motes. 

5.  Data  routed  to  the  relay  mote  within  the  network. 

6.  Data  processing,  display  and  evaluation:  There  is  initial  data  processing  on  the 
TagMote  before  they  are  being  transmitted  to  the  network.  In  addition,  data 
transmitted  to  the  network  and  sent  up  to  the  relay  mote  are  processed  and 
displayed  to  show  the  trajectory  traced  by  the  subject. 

Hardware:  We  used  a  specially  designed  and  built  MEMS  device  (TagMote)  to  mount 
sensors  so  to  collect  data,  process  and  store  these  data  and  then  transmit  to  the  network. 
We  have  designed  and  built  boards  that  contain  a  4MB  flash  memory,  the  sensors  and 
additional  processors.  This  TagMote  board  attaches  to  a  mote  as  a  daughter  board, 
similarly  to  Mica’s  sensor  boards.  We  used  an  integrated  3D-accelerometer  and  3D- 
magnetic  compass  device  that  incorporates  amplification,  signal  conditioning, 
temperature  compensation  etc  made  by  Leica  Geosystems.  In  this  prototype  version,  the 
TagMote  and  Mica  boards  were  packaged  together  and  were  attached  to  the  belt  of  a 
subject;  the  battery  that  supports  the  TagMote  and  Mica  are  packaged  separately  and  are 
placed  in  a  pocket  of  the  subject. 

We  are  reading  six  sets  of  sensor  data  (3  sets  from  the  accelerometer  and  3  sets  from  the 
magnetic  compass)  using  a  Mica  processor  and  we  are  storing  them  in  the  4MB  flash 
memory  of  the  TagMote  board.  We  read  the  data  from  the  flash  memory  and  we  send 
them  to  the  Mica  communication  stack  for  transmission  to  the  network  in  response  to  an 
acknowledgement  of  the  TagMote ’s  Ping  signal  by  the  network.  Just  before  sending  the 
data  to  the  network  we  are  performing  some  preprocessing  to  identify  the  points  where 
min  and  max  occur  in  the  vertical  acceleration  and  we  then  only  transmit  data 
corresponding  to  these  trajectory  points.  Later  versions,  described  in  detail  in  the 
hardware  section,  involve  two  processors  and  more  sophisticated  processing. 

Software:  Special  software  was  developed  to  implement  algorithms  for  processing  the 
sensor  data  on  the  mote  and  before  displaying.  Algorithms  must  detect  occurrence  of 
steps  and  calculate  distance  based  on  some  model  for  the  average  stride  and  compensate 
for  differences. 

In  general,  the  larger  the  amount  of  processing  performed  onboard  the  TagMote,  the 
smaller  the  amount  of  time  required  for  downloading  the  data  necessary  to  display  certain 
distance  traveled  by  the  subject. 
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The  amount  of  processing  performed  onboard  the  TagMote  is  of  course  determined  by 
the  processing  power  available.  In  our  initial  demos  (August  2003),  we  were  limited  by 
the  single  Mica  processor.  In  a  later  TagMote  version,  additional  processing  power  was 
provided  by  an  additional  Mica  processor  and  the  processing  capabilities  were 
significantly  enhanced. 

After  physical  deployment  of  the  sensor  grid,  the  network  must  autonomously 
reconfigure  itself  to  form  a  multi-hop  communication  network  in  which  all  nodes  have  a 
"global"  sync  time.  The  following  middleware  services  are  used  in  the  network: 
Neighborhood  initialization  so  to  build  tables  of  neighborhood  ID’s;  Network  clock 
synchronization;  Localization  (needed  to  determine  TagMote’ s  initial  position)-for  the 
demos  the  initial  position  and  time  was  introduced  manually  into  the  TagMote.  The 
TagMote  uses  a  Ping  Service,  which  is  used  by  the  TagMote  to  find  the  network;  a  Data 
Dump  Service  is  used  by  TagMote  to  download  (dump)  data  to  the  network.  The 
services  need  to  inform  the  TagMote  if  a  transmitted  data  packet  has  been  handed  off  to 
the  network  successfully  and  this  is  done  via  an  acknowledgement.  At  the  completion  of 
the  download  all  the  TagMote  data  are  stored  in  one  or  more  than  one  motes  of  the 
network.  Having  more  than  one  of  the  network  motes  being  involved  with  the  TagMote 
data  download  is  desirable,  since  a  mote  may  only  have  limited  memory  available  and 
also  the  subject  may  be  moving  from  being  close  to  one  mote  to  being  closer  to  a 
different  mote.  Such  scheme  provides  enhanced  flexibility  and  robustness.  The  data 
stored  in  the  network  motes  are  then  transmitted  to  a  relay  or  base  mote  typically  through 
multi-hop  transmissions.  The  middleware  services  need  to  provide  reliable,  high 
throughput  channels  for  streaming  TagMote  data  to  the  destination.  These  were  provided 
by  A  Broker  Service  that  builds  a  spanning  tree  back  to  the  relay,  and  a  Publication 
Service  that  routes  dumped  TagMote  data  to  the  relay  mote. 


Figure  1 . 1  The  display  of  a  tmth  trajectory  against  a  satellite  photo 
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Section  2.  Dead  Reckoning  Algorithm:  Errors  and  Solutions 

This  section  describes  the  algorithms  used  to  process  accelerometer  and  compass  data 
and  estimate  position  using  Dead  Reckoning.  The  algorithms  estimate  the  length  of  step, 
direction  and  step  count. 

A  Dead  Reckoning  (DR)  algorithm  estimates  the  total  distance  from  a  known  starting 
point  by  estimating  the  size  of  each  step  and  detecting  the  heading. 

p(i  + 1  j=  P(f)+  d (iy  a  (Vj 

where  P(i):  Position  after  i  steps;  d(i):  Step  length  of  zth  step;  a  (i):  Magnetic  azimuth  of 
zth  step. 

2.1.  Estimation  of  the  Step  Length 

The  3 -axis  accelerometer  on  the  waist  belt  is  used  to  detect  acceleration  of  body 
movement.  From  the  filtered  signal,  we  can  approximate  the  step  length  by  [1]  (reference 
numbers  refer  to  the  list  at  the  end  of  the  present  section): 

d  =  K  ■  $[a  -A  . 

V  max  min 

where:  A  .  is  the  minimum  acceleration  measured  in  the  Z  axis  in  a  single  stride;  A  is 

the  maximum  acceleration  measured  in  the  Z  axis  in  a  single  stride;  K  is  a  constant  (that 
also  performs  unit  conversion  i.e.  feet  or  meters  traveled). 

This  technique  has  been  shown  to  measure  distance  walked  accurately  to  within  ±3%  for 
the  same  subject  and  ±8%  across  a  number  of  subjects  with  different  leg  lengths  (See 
Tables  2.1,  2.2  and  2.3  for  experiments  conducted  at  Notre  Dame).  Close  coupling  of  the 
accelerometer  to  the  body  is  important  to  maintain  accuracy. 


Errors  and  Solutions: 

•  We  look  at  the  leg  as  being  a  lever  of  fixed  length  while  the  foot  is  on  the  ground. 
The  experimental  results  demonstrate  that  a  constant  can  be  used  for  K  without  a 
large  accuracy  penalty.  GPS  or  distributed  sensor  networks  with  the  localization 
capability  can  be  utilized  to  obtain  more  accurate  step  length  estimation. 

•  The  same  K  can  be  used  for  normal,  slow  and  fast  walking  on  different  surfaces. 
A  shorter  distance  will  be  obtained  for  fast  walking  and  a  larger  variance  for  soft 
surface  such  as  grassland.  The  errors  are  introduced  by  the  violation  of  our 
assumption  that  the  leg  is  fixed  length.  An  adaptive  algorithm  that  “learns”  the 
users  stride  characteristics  could  improve  the  accuracy. 


Table  2.1:  The  const 
straig 

ant  factor  K  for  three  different  subjects  (75m 
it  walking  on  cement  sidewalk) 

Subject 

A 

B 

C 

Leg  Length 

0.94 

1.06 

1.08 

K  (Nominal  Value) 

0.50 

0.55 

0.57 
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Table  2.2.  Using  the  same  K  to  estimate  the  distance  with  different 
walking  patterns  (75m  straight  walking  on  cement  sidewalk) 

Subject 

Normal 

Slow 

Fast 

A(K=  0.5) 

75.3 

74.2 

73.5 

B  (K=  0.55) 

75.5 

74.6 

73.3 

C  (£  =  0.57) 

74.9 

76.0 

72.8 

Table  2.3.  The  estimated  walking  distances  and  errors  (East/West  Component)  when  the 
same  subject  walked  normally  towards  South/North  for  100  meters  at  a  Olympic  stadium 

(with  K  =0.5) 

Case  Number 

Estimated 
Distance  (m) 

Error  1  (m) 

(no  calibration) 

Error  2  (m) 
(with  tilt 
calibration) 

Error  3  (m) 

(with  total 
calibration) 

i 

102.8 

20.2 

5.2 

2.5 

2 

101.6 

20.5 

5.5 

2.9 

3 

102.4 

20.8 

5.3 

2.7 

4 

101.6 

8.4 

1.4 

1.2 

5 

102.4 

9.3 

1.8 

0.8 

6 

103.0 

11.1 

2.0 

0.7 

7 

102.5 

9.9 

5.2 

2.3 

8 

102.2 

9.3 

1.7 

0.9 

9 

102.3 

10.4 

1.0 

1.6 

10 

102.1 

9.0 

0.7 

1.9 

11 

100.1 

2.7 

12.2 

0.5 

12 

103.0 

3.2 

9.7 

0.9 

13 

102.3 

3.3 

9.5 

0.9 

14 

100.7 

4.8 

10.3 

2.5 

15 

102.6 

5.3 

11.2 

3.0 

16 

103.0 

5.6 

11.8 

3.3 

17 

101.6 

4.4 

10.5 

2.0 

18 

102.4 

5.6 

11.8 

3.3 

19 

102.3 

4.8 

11.1 

2.4 

20 

102.2 

5.3 

10.8 

3.0 

Mean 

102.2 

6.8° 

2.1° 

1.1° 

STD 

0.7 

2.1° 

1.0° 

0.5° 

2.2.  Compass  Navigation 

The  magnetic  azimuth  is  the  horizontal  component  of  the  Earth  magnetic  field.  Its 
determination  requires  implicitly  the  knowledge  of  the  horizontal  or  vertical  plane.  This 
is  commonly  done  by  sensing  the  gravity  vector  at  rest.  To  compute  then  the  azimuth  of 
walk,  one  has  to  constantly  compute  the  attitude  of  the  sensor  in  order  to  correct  the 
measured  magnetic  values.  Using  the  3D  rotation  matrix  with  the  Yaw(^)-Pitch(<^)- 
Roll(  0)  sequence,  the  horizontal  components  Hy  and  Hx  are 
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S  =  Inclination  Angle 

Figure  2.1.  Compass  tilt  referenced  to  the  Earth’s  Figure  2.2.  Earth’s  field  in  X,  Y,  Z  coordinates 
horizontal  plane 

H  =b  cos (p  +  b  sin<z>sin#  +  b  siiKpcos# 

H  =  b  cos 6-b  sin<9 

y  y  z 

where  bx,  by,  and  bz  are  components  measured  by  the  3-D  magnetic  compass  (Figure  2.1). 

The  azimuth  derived  from  these  values  will  contain  and  propagate  the  errors  present  in 
the  attitude  angles  themselves.  According  to  the  first  order  Taylor  development  of  the 
azimuth  computation,  the  error  produced  can  be  written  as  [2] 

A  a  =  -A#-  tan  <7  •  cos«  -  iXcp  •  tan  A  •  sina 

where  8  is  the  angle  of  the  magnetic  field  to  the  surface  of  the  earth,  also  called  the  dip, 
or  inclination,  angle  (See  Figure  2.2).  In  the  northern  hemisphere,  the  inclination  angle  is 
roughly  70°  down  toward  the  North. 

This  relation  shows  that  the  error  in  determining  the  attitude  angles  affects  directly  the 
azimuth  and  its  effect  strongly  depends  on  the  azimuth  itself.  For  mid-latitude,  the 
average  value  of  2  for  tan  8  can  be  considered. 

Errors  and  Solutions: 

Heading  error  is  affected  by: 

•  Sensor  platform  accuracy  (±0.5°):  A/D  converter  resolution,  magnetic  sensor 
errors,  temperature  effects 

•  Magnetic  disturbances:  Sources  of  magnetic  fields  include  permanent  magnets, 
motors,  electric  currents,  and  magnetic  metals  such  as  steel  or  iron.  For  example, 
a  long  ferroconcrete  building  can  cause  a  four-degree  heading  error  (Figure  2.3). 

•  Compass  tilt  errors:  A  one-degree  error  in  tilt  can  cause  nearly  three  degrees  of 
heading  error.  (The  inclination  angle  of  the  earth’s  field  near  our  office  at 
University  of  Notre  Dame,  Indiana,  is  approximately  70°  [3].) 

•  Variation  of  the  earth’s  field:  The  angular  difference  between  the  magnetic  and 
true  north  expressed  as  an  Easterly  or  Westerly  variation.  This  difference  is 
defined  as  the  variation  angle  and  is  dependent  on  the  compass  location  - 
sometimes  as  large  as  25°. 
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Heading  error  caused  by  a  long  ferroconcrete  building 


1  DeBart 
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Figure  2.3.  Walking  close  to  a  ferroconcrete  building  may  introduce  a 
heading  error. 

Dynamic  (tilt)  and  static  calibrations  are  needed  to  compensate  for  compass  errors: 

•  Dynamic  calibration:  Dynamic  calibration  will  be  necessary  to  compensate  the 
individual  errors  caused  by  walking.  Low-pass  filtering  the  additional 
accelerations  eliminates  the  typical  oscillations  in  walking.  If  the  computed  pitch 
and  roll  values  reflect  the  movements  done  by  the  hips,  virtual  values  have  to  be 
defined  to  compensate  the  displacement  effects  on  the  horizontal  plane.  These 
additional  constant  corrections  reflect  the  individual  characteristics  of  a  walk  and 
the  symmetry  between  left  and  right  strides. 

•  Static  calibration:  When  a  two-axis  (X,  Y)  magnetic  sensor  is  rotated  in  the 
horizontal  plane  with  no  disturbances,  the  output  plot  of  Hx  vs.  Hy  will  form  a 
circle  centered  at  the  origin  (0,  0).  The  effect  of  a  magnetic  disturbance  on  the 
heading  will  be  to  distort  the  circle,  resulting  in  an  ellipsoid  not  centered  at  the 
origin.  For  example,  even  the  human  body  can  offset  the  circle  as  shown  in  Figure 
2.4. 


Effect  of  magnetic  disturbances  (360-degree  standing) 


Figure  2.4.  The  effect  of  body  offset 
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Two  scaling  factors  Xsf  and  Fs/  can  be  determined  to  change  the  ellipsoid  response  to  a 
circle.  Offset  values  X0ff  and  F^/  can  then  be  calculated  to  center  the  circle  close  to  the 
origin  [4],  Applying  the  corrections  to  the  projected  magnetic  values,  we  have 

+  X 


H  =X-H 

x  sf  X 


Off 


H  =Y-H  +7tf 

y  sj  y  _mes  ojj 


The  effect  of  total  calibration  (dynamic  and  static  calibration)  has  been  shown  in  Table 
2.3.  A  heading  error  of  1.1°  is  obtained  after  total  calibration,  compared  to  a  heading 
error  of  6.8°. 


(b) 

Figure  2.5.  Effects  of  the  different  calibration  phases,  (a)  A  cross-type  walk 
on  DeBartolo  Quad  at  ND;  (b)  A  walk  on  a  400m  track  Olympic  stadium 
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2.3.  Summary 

The  achievable  precision  for  the  dead  reckoning  algorithm  is: 

•  Distance  estimation:  ±%3  with  different  walking  speed  on  different  surfaces 

•  Azimuth  estimation:  ±1°  in  open  area;  ±5°  in  urban  area  (without  strong 
magnetic  disturbances) 

The  performance  of  the  algorithm  can  be  further  improved  if  GPS  or  gyro  information  is 
available. 
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Section  3.  Tagmote  Hardware 

This  section  describes  both  electrical  and  mechanical  designs  of  the  Tagmote  mobile 
sensor.  Technical  drawings  and  electrical  schematics  are  provided  for  the  reader’s 
reference.  Note  that  here  the  latest,  2  processor  version  of  the  TagMote  is  described. 
Details  of  earlier  versions  may  be  found  in  the  PI  presentations  that  accompany  this 
report. 

3.1.  Electronic  Design 

The  latest  Tagmote  design  consists  of  four  main  components  the  MICA,  the  slave 
processor  board,  the  interface  board,  and  the  Leica  sensors.  Figure  3.1  displays  a  block 
diagram  of  the  Tagmote.  The  slave  processor  is  in  charge  of  executing  most  of  the  data 
processing  while  a  Berkeley  MICA  platform  handles  the  communication  and  networking 
activities.  The  Leica  Geosystem’s  DMC-SX  provides  a  digital  magnetic  compass  and  a 
three-axis  accelerometer,  which  is  connected  to  the  Leica  through  a  specialized  board  that 
contains  a  4MB  FLASH  memory  for  data  storage. 

The  TagMote  uses  a  Berkeley  MICA  platform  [1]  (reference  numbers  refer  to  the  list  at 
the  end  of  the  present  section)  for  general  coordination  and  communication.  Its  central 
processor  is  an  Atmel  ATMEGA128L  8-bit  processor  running  at  7.3728  MHz.  This 
microcontroller  has  128  KB  of  programmable  flash  memory  and  512KB  of  external 
FLASH  memory.  The  MICA  also  provides  radio  communication  at  433  MHz  (Note  that 
an  earlier  version  of  MICA  used  916  MHz  for  radio  communication;  the  current  version 
with  433  MHz  is  designated  as  MICA2).  The  radio  has  8  channels  and  is  capable  of 
transmitting  data  at  a  rate  of  38.4  Kbaud  with  a  range  of  1000ft  (the  radio  transmission 
range  is  severely  affected  by  ground  absorption  effects).  A  5 1-pin  connector  is  used  to 
interface  with  the  MICA  platform. 


Figure  3.1  Tagmote  block  diagram. 
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Figure  3.2  Schematic  of  Tagmote  interface  board. 

The  Leica  sensor  was  selected  because  of  its  sensor  capability,  small  size,  and  the  ability 
for  data  filtering.  The  Leica  has  a  3 -axis  magnetometer  and  3 -axis  accelerometer  with 
data  rates  up  to  50  Hz  [2],  The  size  of  the  Leica  is  33  mm  x  3 1  mm  x  13.5  mm.  It  has 
flexible  software  with  13  operating  modes,  which  provide  different  levels  of 
compensation.  Communication  with  the  Leica  is  done  by  means  of  one  serial  interface, 
which  is  protected  against  non-valid  commands. 

The  interface  board  connects  the  Leica  to  the  slave  processor  board  and  provides  4  MB  of 
additional  Flash  memory  for  data  storage.  A  schematic  of  the  Tagmote  interface  board  is 
given  in  Figure  3.2.  The  Leica  sensor  connects  to  JP1,  which  supplies  power  (5V), 
ground,  serial  data  input  and  output. 

The  slave  processor  is  in  charge  of  configuring  the  sensor,  retrieving  measurements  from 
it,  pre-processing  the  information,  storing  them  into  the  flash  memory,  and  handing  them 
out  to  the  MICA  processor  to  be  transmitted  over  the  network.  The  slave  processor  board 
is  augmented  with  its  own  ATMEL128L  processor  that  draws  its  power  from  the  5 1  pin 
connector  to  the  MICA  processor.  The  slave  processor  board  interfaces  to  the  interface 
board  using  a  5 1  pin  connector  compatible  with  the  MICA  processor.  The 
communication  between  the  MICA  processor  and  the  slave  processor  is  done  using  a  task 
driven  custom  serial  protocol  that  uses  interrupts  as  acknowledgements. 
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The  input  and  output  signals  for  the  Leica  sensor  are  driven  by  the  TXD  and  RXD  pins  of 
the  slave  processor  serial  port.  The  memory  chip  is  the  AT45DB161B  Ul,  this  serial 
flash  data  chip  has  a  serial  interface  and  works  with  3  V,  which  makes  it  ideal  to  connect 
to  the  MICA  where  small  interface  pin  count  and  low  voltage  operation  is  required.  The 
reset  pin  was  left  unconnected  since  the  chip  provided  its  own  reset  circuit. 

Open  collector  inverters,  74HCT06,  provide  the  voltage  translation  to  connect  the  5V 
Leica  sensor  logic  to  the  3  V  slave  processor  logic.  These  inverters  are  labeled  U2B,  U2C, 
U2E,  U2F  in  Figure  3.2.  Inverters  are  used  instead  of  a  buffer  because  we  want  to 
connect  the  slave  processor  reset  pin  to  the  memory,  which  already  required  an  inverter. 
Open  collector  inverters  have  at  their  output  a  transistor  with  nothing  connected  at  its 
collector  end  and  grounded  at  its  emitter.  The  collector  end  is  available  at  the  output  pin. 
So  this  can  be  used  to  transform  the  logic  at  their  input  (which  in  this  case  is  3  V)  to  any 
higher  voltage  (5  V)  by  placing  a  resistor  connected  at  this  higher  voltage.  When  the 
output  is  low,  the  transistor  is  saturated  and  a  voltage  of  about  0. 15  Volts  can  be  obtained 
with  the  appropriate  resistor  (R3).  When  the  output  is  high,  the  transistor  is  off  and  the 
voltage  on  the  output  will  be  approximately  5  V  is  the  input  being  fed  has  much  higher 
impedance  to  ground  than  the  resistor  used.  On  the  other  hand  U2F  is  used  to  translate 
the  5  V  output  logic  of  the  Leica  sensor  to  the  3  V  input  logic  of  the  slave  processor.  Here 
D1  is  used  to  prevent  the  input  to  U2F  be  higher  that  3.5V  while  R4  is  used  to  limit  the 
current  that  goes  through  D1  when  the  output  of  the  Leica  sensor  is  5  V.  All  other 
resistors  are  needed  to  place  a  high  level  of  3V  to  the  inverters  outputs. 

All  capacitors  used  are  decoupling  capacitors  for  the  3  V,  5  V,  and  unregulated  power 
supply  for  the  linear  5  V  regulator  U4  ZSR500G.  The  unregulated  power  supply  is  fed  to 
the  circuit  though  JP2.  For  our  tests,  we  used  a  9V  alkaline  battery  to  provide  the 
unregulated  power.  Note  that,  relatively  large  decoupling  capacitors  are  needed  for  the 
5  V  power  regulator  (>20uF)  since  the  Leica  draws  large  intermittent  amounts  of  current 
during  its  power-up. 

Atmel  provides  several  memory  chips  with  the  same  footprint  and  same  interface,  which 
allows  future  memory  expansions  by  just  replacing  chip  Ul.  We  are  currently  using  an 
Atmel  DATAFLASH  AT45DB161B  3V  read/write  FLASH  memory  chip,  which 
provides  4MB  of  data  storage  [3].  The  interface  consists  of  SCLK  (serial  clock),  SO 
(serial  output),  SI  (serial  input),  and  CS  (chip  select).  All  pins  are  inputs  except  SO.  All 
transactions  are  started  by  selecting  the  chip  with  CS.  After  that  data  can  be  shifted  in 
through  SI  or  out  through  SO,  each  bit  is  clocked  using  SCLK  which  is  driven  by  the 
MICA.  The  protocol  that  manipulates  the  pins  appropriately  was  implemented  by 
software. 


3.2.  Mechanical  Design 

Several  different  housing  cases  were  developed  to  protect  the  TagMote’s  electrical 
hardware  and  sensors  from  environmental  damage.  Some  designs  housed  both  the 
batteries  and  Tagmote  together.  However,  the  current  design  separates  the  power  supply 
from  the  MICA  and  Leica  sensor.  This  was  done  for  a  couple  of  reasons.  First,  designs 
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that  held  the  circuitry,  sensors,  and  power  supply  were  much  larger  and  had  a  tendency  to 
cantilever  during  testing,  which  produced  significant  error  in  the  data.  Furthermore,  the 
close  proximity  of  the  batteries  to  the  Leica  sensor  tends  to  cause  a  large  disturbance  in 
the  magnetometer  readings. 

The  current  design  consists  of  an  epoxy  case,  which  houses  the  MICA,  interface  board, 
and  Leica  to  prevent  them  from  environmental  damage,  and  a  hand-held  case  for  the 
power  supply.  Technical  drawings  of  the  Tagmote  case  are  provided  at  the  end  of  this 
section.  Note  that  this  case  was  used  for  the  second  version  of  the  TagMote  and  photos 
may  be  found  in  the  PI  presentations  that  accompany  this  report.  Stereolithagraphy  was 
used  to  manufacture  the  housing  case  because  it  provided  the  flexibility  needed  for  the 
design  and  was  very  economical. 
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Tagmote  Hardware  Characteristics 


Processing  MICA2  board 

Processor: 

Program  Memory: 

Program  RAM: 

Radio: 

Operating  Voltage: 


Atmel  ATMEGA128L  8bit  RISC  @  7.3728MHZ 
128  Kbytes 
512  Kbytes 

38.4  Kbits/sec  @  433  MHz 
3  Volts 


Slave  Processor  board 

Processor: 

Program  Memory: 
Program  RAM: 

Radio: 

Operating  Voltage: 


Atmel  ATMEGA128L  8bit  RISC  @  7.3728MHZ 

128  Kbytes 

512  Kbytes 

none 

3  Volts 


Sensor  &  Log  Memory  Board 

Sensor:  Leica  3  axis  accelerometer  3  axis  magnetometer 


Interface: 

Sampling  Freq: 

Data  Frame  Size: 
Effective  Data  Size: 
Sensor  Voltage: 


UART  w.  voltage  translator  i 
30  Hz 

28  bytes/sample 
12  bytes/sample 
5  Volts 


9600  bits/sec 


Memory  Chip: 

Memory  Type: 

Memory  Capacity: 
Maximum  No  Samples: 
Memory  Voltage: 
Maximum  Operating  Freq: 
Interface: 


Atmel  Data  Flash  AT45DB161B 
Serial  Interface  Flash  Memory 
16  Mbits  (4MB) 
variable:  aprox  12  hours 
3  Volts 
20  MHz 

Interrupt-driven  RAM  circular  buffer  data  collection, 
Task-driven  write/read  software  serial  interface. 
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R0.88 


Technical  Drawings  of  Tagmote  Case 
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SCALE 
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SCALE  2,000 


Section  4.  Tagmote  Software,  NEST  Middleware  and  Base  Station 
Software 

Recall  the  setting  of  the  Red  Force  Tagging  project.  A  Tagmote  acquires  data,  and  then 
downloads  it  to  a  network,  when  the  Tagmote  is  in  the  proximity  of  the  network.  The 
network  consists  of  a  number  of  motes,  which  we  call  network  motes.  The  RelayMote  is 
a  special  network  mote  which  is  physically  connected  to  the  base  station.  The  data 
downloaded  to  the  network  is  sent  to  the  RelayMote,  which  forwards  it  to  the  base  station. 
The  base  station  processes  the  data  and  displays  it.  All  communication  is  wireless,  except 
for  the  communication  between  the  RelayMote  and  the  base  station. 

The  software  developed  in  this  project  can  be  divided  in  several  categories: 

•  Tagmote  specific  software 

•  NEST  Middleware  for 

o  The  network  motes 
o  The  RelayMote 
o  The  Tagmote 

•  Base  station  software  for 

o  Network  monitoring/debugging 
o  Data  decoding 
o  Data  processing  and  display 

This  section  contains  a  description  these  software  components,  except  for  the  display 
software,  which  is  detailed  in  the  next  section.  The  software  exists  in  several  working 
versions.  The  latest  version  of  the  NEST  middleware  allows  for  data  compression. 

A  brief  description  of  the  main  functions  of  the  software  is  as  follows. 

o  The  Tagmote  software  acquires  data  from  the  Tagmote  sensors,  processes  it,  and 
saves  it  in  the  memory.  The  processing  includes  a  form  of  data  compression, 
which  increases  the  amount  of  data  that  can  be  stored  in  the  memory  and 
decreases  the  download  time.  After  the  prescribed  amount  of  data  is  recorded  in 
the  memory,  the  Tagmote  begins  searching  for  a  network.  When  a  network  is 
found,  the  data  is  downloaded  to  the  network.  The  Tagmote  software  is 
distributed  on  two  processors,  containing  also  a  software  module  governing  the 
communication  between  the  processors. 

o  The  middleware  software  provides  networking  services,  including  a  global  time 
(synchronization),  routing  services,  and  data  logging.  Thus,  the  network  motes 
that  receive  data  from  the  Tagmote  can  save  it  by  means  of  the  data  logging 
service,  and  then  route  it  towards  the  relay  by  means  of  the  routing  services. 

o  The  base-station  software  receives  the  data  forwarded  by  the  relay,  and  decodes 
(decompresses)  it.  The  decompressed  data  is  then  sent  to  a  java  tool  that  processes 
it  and  displays  the  calculated  trajectory  graphically. 

4.1.  Tagmote  Software 

In  addition  to  some  of  the  NEST  middleware  components,  described  in  the  next 
subsection,  the  Tagmote  includes: 
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o  “hardware”  components  (that  is,  components  that  monitor  the  hardware) 
o  a  communication  component 

The  communication  component  allows  a  distributed  implementation  of  the  software  on 
two  processors.  One  processor  is  on  a  regular  mica2  board,  while  the  other  on  a  custom 
made  board.  A  master/slave  like  architecture  is  used  in  which  the  mica2  board  is  the 
master  and  the  custom  made  board  the  slave.  The  master  runs  the  middleware  software 
and  the  communication  component.  The  slave  runs  the  hardware  components  and  the 
communication  component. 

The  communication  between  processors  uses  one  line  for  serial  communication  and  two 
lines  for  interrupt  signals.  The  communication  is  asynchronous.  To  avoid  slowing  down 
more  important  tasks,  communication  is  implemented  by  a  low  priority  task.  Thus,  the 
data  rate  depends  on  how  busy  the  processors  are.  The  communication  uses  a  protocol 
that  ensures  data  is  transferred  in  a  reliable  manner;  only  hardware  failures  may  cause 
transmission  errors. 

The  “hardware”  components  also  implement  some  real-time  processing  of  the  sensor  data. 
The  role  of  this  processing  is  to  reduce  the  download  time  and  filter  the  sensor  data.  The 
download  time  is  reduced  in  two  ways: 

o  Some  of  the  processing  required  to  estimate  the  trajectory  from  the  raw  data  takes 
place  in  the  Tagmote,  in  real-time.  Thus,  the  Tagmote  does  a  min/max  detection, 
required  by  the  estimation  algorithm.  Since  only  the  min/max  values  are  sent,  8 
times  less  data  (on  the  average)  is  sent  to  the  base  station, 
o  Data  is  compressed,  by  using  differential  measurements.  A  differential 

measurement  consists  of  the  difference  between  the  current  data  and  the  previous 
data.  If  data  does  not  vary  very  fast,  fewer  bits  can  be  used  for  the  difference  than 
for  the  current  data  value.  On  the  average,  this  technique  speeds  up  the  download 
time  by  a  factor  of  2. 

The  Tagmote  “hardware”  components  are: 

•  HWTAGMOTE:  This  component  is  in  charge  of  orchestrating  all  the  hardware 
components  and  providing  an  interface  for  the  top-level  application.  Real-time 
processing  of  sensor  data  is  also  done  in  this  component.  Further,  the  component 
initializes  the  lower  level  components,  initiates  the  process  of  data  acquisition  and 
storage  of  the  data  received  from  the  Leica  sensor  to  the  Flash  memory,  stops  the 
acquisition  of  data  when  the  number  of  lines  read  reaches  a  certain  number,  and 
finally  it  issues  read  commands  to  the  flash  that  will  trigger  a  data  read  ready 
event  to  retrieve  the  data  from  the  Flash. 

•  FLASH:  This  component  manages  the  Serial  Flash  chip.  The  commands  that  can 
be  issued  to  this  component  are:  Start  Reading,  Start  and  Stop  Writing,  Write,  and 
Read.  The  actual  processes  that  involve  directly  the  Flash  are  managed  using 
tasks.  This  avoids  blocking  the  microcontroller  from  performing  other  interrupt 
driven  tasks.  The  Write  command  stores  the  data  in  a  circular  buffer  (100  bytes). 

A  task  called  WritingProcess  is  in  charge  of  transferring  the  data  in  the  circular 
buffer  to  the  Flash  memory.  The  circular  buffer  structure  allows  a  continuous 
generation  of  data  to  be  transferred  to  the  Flash  in  bursts.  This  is  because  the 
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Flash  data  is  written  first  to  an  internal  buffer;  after  the  buffer  is  full  a  command 
is  issued  to  transfer  the  contents  to  the  main  memory.  The  WritingProcess  task 
keeps  track  of  the  internal  buffer  status  and  initiates  the  transfer  to  main  memory. 
It  will  repost  itself  after  each  action,  until  no  more  command  to  the  Flash  are  to  be 
issued  and  the  temporal  buffer  is  empty.  The  Read  command  posts  a 
ReadingProcess  task.  Similarly  to  the  WritingProcess  the  data  is  read  through  a 
Flash  internal  buffer  and  not  directly  from  main  memory.  Therefore 
ReadingProcess  keeps  track  of  the  contents  of  the  internal  buffer  and  initiates  the 
transfer  from  main  memory  to  the  internal  buffer  when  necessary.  After  a  data 
byte  is  retrieved,  ReadingProcess  generates  an  event  with  the  read  byte.  Since  the 
read  and  write  commands  set  the  Flash  memory  in  different  modes  a  complex 
system  of  semaphores  and  flags  is  used  to  prevent  both  commands  occurring  at 
the  same  time. 

•  MODEB:  This  component  generates  the  appropriate  command  sequences  for  the 
Leica  sensor  that  sets  the  Leica  in  continuous  measurements  of  three 
accelerometers  and  three  magnetometers.  It  also  decodes  the  data  sent  by  the 
Leica  sensor. 

•  UARTPACKET :  It  receives  the  data  to  be  sent  back  and  forth  to  the  Leica  Sensor. 
It  receives  data  from  the  Leica  in  a  byte-by-byte  fashion,  strips  the  frame  from  its 
overhead  and  forwards  the  payload  data  to  the  MODEB  component.  It  also 
receives  the  frames  to  be  sent  to  the  Leica  and  transmits  them  byte  by  byte  to  the 
sensor  through  the  UART  component. 

•  UART:  It’s  the  component  that  manages  the  UART  hardware,  it  initializes  the 
hardware  and  receives  and  transmits  the  bytes  from  and  to  the  UART. 

The  structure  of  the  hardware  components  is  shown  next: 


Figure  4.1  Hardware  components  structure 

The  operation  of  the  master  is  as  follows.  The  master  waits  for  the  slave  to  signal  that  it 
has  finished  acquiring  sensor  data.  Then,  it  begins  to  search  for  the  mote  network,  in 
order  to  download  the  data.  The  search  is  done  by  means  of  the  ping  component  of  the 
middleware.  The  PING  component  sends  out  Hello  messages  until  a  network  mote 
answers  indicating  that  it  is  ready  to  receive  the  data  from  the  Tagmote.  Once  the  PING 
component  receives  such  an  acknowledgement,  it  generates  an  event  in  TAGMOTE  that 
in  return  issues  a  dump  command.  The  DUMP  component  requests  data  from  the  slave, 
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and  then  forwards  it  to  the  pinged  mote.  For  every  transmitted  frame,  the  receiving 
network  mote  replies  with  an  acknowledgement  message.  If  an  acknowledgement 
message  is  not  received,  the  DUMP  process  is  stopped  and  the  PING  process  is  started 
again  to  find  a  new  recipient  for  the  data.  The  download  process  is  complete  when  the 
slave  signals  it  has  no  more  data  to  send. 

The  Tagmote  code  has  several  working  versions.  The  older  versions  moved  gradually 
from  a  centralized  (one  processor)  implementation  that  was  simply  sending  raw  data  to 
the  network,  towards  a  distributed  (two  processor)  implementation,  in  which  data 
undergoes  some  real-time  processing,  including  a  data  compression  process.  Other 
technical  improvements  were  also  added  along  the  way. 

4.2.  NEST  Middleware 

The  NEST  middleware  consists  of  a  number  of  services  for  network  communication. 

Each  of  the  Tagmote,  RelayMote,  and  network  motes  uses  some  of  the  middleware 
services,  as  described  below. 

The  broker  service  builds  a  spanning  tree  in  which  the  relay  is  the  root  and  the  network 
motes  appear  as  nodes  or  leaves.  The  network  motes  use  this  service  in  order  to 
determine  the  next  hop,  when  they  forward  the  data  downloaded  by  the  Tagmote  towards 
the  relay.  The  broker  service  relies  on  the  synchronization  service.  The  broker  service  is 
used  by  the  network  motes  and  the  RelayMote. 

The  dump  service  is  the  name  of  the  Tagmote  service  that  sends  data  to  a  network  mote 
and  also  the  name  of  the  service  of  the  network  motes  that  stores  locally  the  data 
transmitted  by  the  Tagmote.  This  is  the  protocol:  The  Tagmote  sends  ping  signals 
periodically.  When  the  Tagmote  is  in  the  range  of  the  network  of  motes,  the  motes  that 
hear  the  ping  signals  reply.  Then,  the  Tagmote  establishes  the  connection  with  one  of  the 
network  motes  that  has  replied  and  dumps  data  to  that  mote  until  there  is  no  more  data  to 
send  or  no  acknowledgments  to  the  data  sent  are  received.  The  latter  may  happen  when  (i) 
the  Tagmote  position  has  changed  and  no  longer  allows  communication  with  the  network 
mote  or  (ii)  the  network  mote  has  filled  its  storage  space  with  data.  In  either  case,  the 
Tagmote  will  start  again  sending  ping  signals.  The  network  motes  store  the  data  dumped 
by  the  Tagmote  in  the  local  EEPROM. 

The  neighborhood  service  maintains  a  list  of  the  (active)  neighbors  a  mote  has.  Note 
that  mote  B  is  a  neighbor  of  mote  A  if  A  and  B  can  communicate  directly  (i.e.  single  hop). 
Note  that  the  operation  of  the  synchronization  service  relies  on  the  neighborhood  service. 
The  neighborhood  service  is  used  by  the  network  and  RelayMotes. 

The  ping  service  is  the  service  used  by  the  Tagmote  when  it  seeks  to  connect  to  the 
network.  The  same  name  is  used  to  denote  the  network  mote  service  that  replies  to  the 
ping  messages  sent  by  the  Tagmote.  The  operation  of  this  service  is  as  follows.  The 
Tagmote  sends  ping  messages.  If  a  network  mote  hears  a  ping  message,  it  replies  with  a 
message  that  includes  its  own  address.  Then,  if  the  Tagmote  hears  any  reply,  it  selects 
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one  of  the  replying  motes  as  destination  for  the  dump  service. 

The  publication  service  transmits  the  data  stored  in  the  network  motes  to  the  RelayMote. 
Each  mote  that  has  stored  data  dumped  by  the  Tagmote,  starts  (at  a  random  time) 
forwarding  data  packets  toward  the  RelayMote.  The  protocol  that  is  used  is  best  seen  in 
the  source  code  (PublishM.nc).  The  protocol  associates  a  buffer  to  each  mote.  The  buffer 
is  filled  either  from  the  EEPROM,  if  the  Tagmote  has  dumped  data  there,  or  from  data 
received  from  other  motes.  Note  that  data  is  progressively  sent  toward  to  the  RelayMote 
according  to  the  routes  described  in  the  spanning  tree  produced  by  the  broker  service.  A 
mote  X  starts  (at  a  random  time)  sending  the  data  from  its  buffer  toward  a  neighbor  mote 
Y  by  means  of  a  "set"  message.  Y  acknowledges  with  an  "ack"  message,  and  then  X 
starts  sending  data  packets  with  "dat"  messages.  Y  replies  with  "ack"  messages.  Either 
of  X  and  Y  can  clear  (close)  the  connection  with  a  "clr"  message.  There  are  also  locking 
and  timeout  mechanisms  that  ensure  the  proper  operation  of  the  protocol.  The  service 
achieves  about  400,  250,  and  100  packets/min  for  one,  two,  and  three  hops,  respectively. 
(The  throughput  goes  down  as  we  increase  the  number  of  hops  in  the  route.)  Note  that  the 
relay  version  of  the  service  forwards  to  the  base  station  the  packets  received  from  the 
publication  service  of  the  network  motes. 

The  route  service  provides  means  of  detecting  the  state  of  the  network.  Thus,  the 
network  motes  use  this  service  to  send  to  the  relay  data  packets  that  contain  the  local 
address  (i.e.  the  identity)  of  the  mote  and  the  local  addresses  of  its  neighbors.  The  relay 
forwards  the  packets  it  receives  to  the  base  station. 

The  synchronization  service  is  used  by  the  network  and  RelayMotes  to  create  a  global 
clock.  In  this  implementation,  the  clock  has  a  period  of  roughly  40sec.  The 
synchronization  service  is  implemented  together  with  the  neighborhood  service  in 
HoodM.nc.  Experimental  results  indicate  a  variance  of  10msec  in  event  synchronization. 

The  telemetry  service  is  useful  for  debugging  purposes.  It  implements  commands  that 
are  received  by  radio.  This  service  has  been  used  in  a  setup  involving  a  computer  running 
the  NestConsole  java  tool  that  is  also  connected  to  a  GenericBase  mote  (i.e.  a  mote  that 
runs  the  GenericBase  application  of  the  TinyOS  1.x  distribution).  In  this  setup,  the  user 
can  send  commands  to  the  network  motes  using  the  NestConsole  tool.  The  GenericBase 
mote  is  used  to  forward  these  commands  and  to  listen  to  the  network  messages. 

The  middleware  services  have  several  working  versions.  The  relay  and  network  mote 
version  that  can  handle  Tagmote  packets  with  compressed  data  is  called 
NestRelay  verC.l,  consisting  of  wNestMica2_ver2.8C  (the  extension  for  compressed 
data  of  the  2.8  middleware  version  of  the  network  motes)  and  wRelayMica2_ver2.1C  (the 
extension  for  compressed  data  of  the  2. 1  middleware  version  of  the  RelayMotes).  Note 
that  the  compression  version  and  the  previous  versions  are  incompatible:  the  compression 
version  of  the  middleware  cannot  handle  the  data  format  of  the  software  without 
compression,  and  the  version  of  the  middleware  for  uncompressed  data  cannot  handle 
compressed  data.  Thus,  the  latest  version  of  the  Tagmote  software  can  only  be  used  with 
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the  NestRelay  verC.  1  code,  and  the  previous  versions  of  the  Tagmote  software  cannot  be 
used  with  the  NestRelay_verC.  1  code. 

4.3.  Base  Station  Software 

The  base-station  software  consists  of  java  tools  and  the  executable  undif,  implementing  a 
C  program.  A  java  tool,  called  NestConsole  is  responsible  of  saving  the  data  packets 
received  from  the  RelayMote  in  a  file.  That  file  is  then  read  by  undif,  which 
decompresses  the  data.  Finally,  the  decompressed  data  is  processed  by  a  java  tool  for 
processing  and  displaying  the  data. 

Note  that  there  are  two  forms  of  the  NestConsole  tool.  One,  which  is  found  with  the 
network  mote  software,  is  meant  to  work  with  a  GenericBase  mote,  instead  of  a  relay. 

The  other  one,  found  with  the  RelayMote  software,  assumes  the  base  station  is  connected 
to  the  relay.  Both  NestConsole  tools  plot  also  the  status  of  the  network.  For  instance, 
links  are  used  to  display  the  motes  that  are  neighbors. 

There  are  also  several  versions  of  the  NestConsole  tool.  The  older  versions  have  been 
used  with  the  middleware  for  uncompressed  data.  The  latest  version  is  a  slight 
modification  that  allows  proper  interfacing  with  the  decompression  tool  undif. 

4.4.  Outline  of  System  Operation 

1.  From  the  Tagmote  to  the  network  motes: 

When  the  Tagmote  is  ready  to  dump  data,  it  pings  to  connect  to  the  network.  If  a  network 
mote  can  hear  the  ping  message  and  has  an  active  parent  to  which  it  can  transmit  the  data, 
it  will  respond  to  the  ping.  Then,  the  Tagmote  begins  to  dump  data  to  the  network  mote. 
When  the  data  buffer  of  the  network  mote  is  full,  the  Tagmote  looks  for  another  network 
mote  and  continues  downloading  its  data. 

2.  From  the  network  motes  to  the  relay: 

When  a  network  mote  has  data  to  transmit  to  the  relay,  it  chooses  its  parent  in  its  routing 
list  and  tries  to  transmit.  If  the  trial  fails  (no  acknowledgements  are  received),  the 
network  mote  will  change  to  another  parent  in  its  routing  list  until  it  is  successful.  Then 
the  network  mote  begins  transmitting  data  to  its  parent.  This  is  repeated  until  the  data 
reaches  the  relay.  Note  that  if  the  network  mote  loses  the  link  in  the  middle  of  a 
transmission,  it  has  the  ability  to  change  its  parent  and  set  up  a  new  path  to  communicate. 

4.5.  Files 

Note  that  the  software  code  developed  for  this  project  has  been  delivered  to  DARPA  and 
is  not  included  with  this  report. 

The  most  recent  software  is  included  in  the  following  files: 

o  The  middleware  for  the  relay  and  network  motes  as  well  as  the  java  tool 
NestConsole  and  undif  are  found  in  NestRelay_verCT.zip.  Included  are  also 
examples  of  installation  and  data  processing  shell  scripts  (inst  and  traj).  Note  that 


24 


the  middleware  requires  TinyOS  1.0.  Some  minor  updates  may  be  required  to 
compile  it  in  the  more  recent  versions  of  TinyOS. 
o  The  Tagmote  software  is  found  in  MR5. 1  .zip.  Note  that  it  consists  of  two 
components:  the  code  for  the  master  processor  (the  one  on  the  mica2)  and  the 
code  for  the  slave  processor  (the  one  on  the  mica  radio  board).  Note  also  that  the 
code  requires  TinyOS  1.1. 
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Section  5.  Display  and  GUI  Design 

This  section  explains  the  steps  taken  to  process  and  display  the  data  when  they  are 
received  by  the  Relay  mote. 

5.1.  GUI  Design  Using  Java 

The  first  attempt  was  to  create  an  applet,  so  that  the  application  could  be  executed  and 
monitored  online.  The  terrain  was  manually  drawn,  using  simple  graph  methods  (JAVA). 
The  points  were  being  read  from  a  file.  The  interface  was  as  shown  below. 
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The  second  step  was  to  create  a  stand  alone  program  that  would  use  the  canvas  and  frame 
routine,  so  that  it  would  be  platform  independent  and  would  be  executable  on  all 
machines.  The  program  was  first  reading  the  contents  of  a  text  file,  containing  3  columns, 
separated  by  commas,  that  is  x,  y,  time  and  depicting  them  on  a  map  (either  an  icon  or  a 
graph).  Later  on  the  code  was  changed  from  reading  a  text  file  to  listening  to  the  standard 
input  for  data  in  the  same  format.  At  this  point  the  code  was  also  made  more  independent, 
regarding  the  length  and  format  of  the  data,  the  only  restriction  was  that  the  3  columns 
should  be  separated  by  commas.  This  version  had  a  big  disadvantage.  If  we  wanted  to 
change  the  starting  point  or  the  scaling  factor  of  the  trajectory  (the  latter  is  associated 
with  the  K  factor,  which  depends  on  the  subject),  we  had  to  change  the  code  and 
recompile.  This  problem  was  solved  by  adding  text  boxes  on  the  interface,  two  for  the 
coordinates  of  the  starting  point,  one  for  the  scaling  factor  and  later  on,  two  for  the  xoff 
and  yoff  parameters  (to  adjust  the  Leica  module).  So  the  user  could  insert  the  appropriate 
values  in  the  boxes  and  redraw  the  trajectory  to  make  it  match  closer  the  terrain. 

The  final  interface  looks  as  follows: 
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5.2.  Implementation  of  DR  Algorithm  Using  Java 

After  the  Dead  Reckoning  (DR)  algorithm  was  developed  and  fully  tested,  the  data  that 
arrived  at  the  Relay  were  exfiltrated  and  the  algorithm  was  implemented  using  Java  for 
easy  system  integration. 

Since  Java  lacks  the  predefined  functions  for  matrix  and  vector  computation,  those 
functions  are  first  coded.  With  the  prototype  algorithm  in  Matlab,  it  was  relatively  easy  to 
debug  the  Java  program.  Using  the  same  raw  data,  the  results  from  the  Java  program 
matched  well  the  results  from  the  Matlab  program.  (Difference  was  less  than  0.001).  The 
Java  code  was  shown  to  be  robust  and  accurate.  This  Java  program  involves  two  versions: 
the  first  version  corresponded  to  our  initial  design,  the  board  where  the  raw  data  from  the 
sensors  were  used  as  our  input  data.  The  second  version  corresponded  to  a  later  design, 
when  we  had  added  the  preprocessing  capability  to  the  Tagmote.  At  the  Relay  we  were 
processing  only  the  min  and  max  data  points  extracted  by  the  preprocessing  algorithm. 
With  the  well-designed  class-structure  and  functions  of  the  first  version,  the  second 
version  was  only  a  minor  simplification  of  the  previous  version,  which  justified  the 
notion  of  software  reuse.  The  output  from  this  program  was  set  of  points  of  the  walked 
trajectory. 

5.3.  Visualization  of  the  Trajectory 

After  computing  the  trajectory  points,  the  next  step  involved  plotting  the  trajectory  for 
visualization.  The  terrain  JPG  file  was  loaded  as  the  background  and  the  trajectory  was 
plotted  against  this  background.  In  a  typical  situation,  initially  the  scaling  and  orientation 
of  our  trajectory  did  not  match  the  actual  path,  since  we  did  not  know  the  ratio  of  the 
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physical  distance  and  the  screen  distance.  This  problem  was  solved  by  introducing  a 
scaling  factor  that  could  be  easily  adjusted  on  the  screen,  so  that  a  good  match  could  be 
obtained.  Considering  the  compass  body  offset,  we  used  another  two  parameters  to 
adjust  the  offset  of  the  magnetic  readings.  After  the  1st  demo,  we  integrated  the  DR 
algorithm  and  visualization  as  one  program  instead  of  using  a  “pipe”  command  to  put 
them  together.  The  ability  to  use  the  mouse  to  choose  the  starting  point  was  also  added. 
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Section  1.  Executive  Summary 

Analytical  Modeling,  Performance  Evaluation,  and  Optimization  of  Large  Systems  of 
Networked  Embedded  Sensors 

Networked  Embedded  Wireless  Systems  have  a  dynamic  multihop  topology  composed  of 
power-  and  bandwidth-constrained  wireless  links.  The  topology  changes  with  time  as  the 
qualities  of  the  wireless  channels  vary  and  as  the  nodes  move  or  adjust  their  transmission 
and  reception  parameters;  it  cannot  be  determined  or  predicted  and  therefore  has  to  be 
modeled  as  random.  Therefore  our  objectives  of  this  theoretical  part  were 

(1)  to  model  the  topological  uncertainties  of  large  sensor  networks  and 

(2)  to  identify  solultions  to  cope  with  and  manage  the  inherent  non-determinism. 

Related  to  network  topologies,  we  derived  general  analytical  expressions  for  the 
intemode  distances  and  showed  that  nearest-neighbor  routing  schemes  perform  poorly  in 
terms  of  energy  consumption  and  throughput.  For  better  performance,  we  introduced 
quasi-regular  networks  with  significantly  smaller  distance  variance  and  much  improved 
coverage  properties.  At  the  link  level,  we  introduced  simple  yet  accurate  analytical  packet 
reception  models  that  incorporate  fading  as  a  small-scale  variation  in  the  received  signal 
power.  This  model  was  shown  to  predict  the  packet  reception  probabilities  observed  in 
real  networks  rather  well.  We  also  studied  the  throughput  performance  of  interference- 
limited  networks  and  extended  standard  "disk  graph"  models  to  graphs  with  non-isotropic 
connectivity.  Further  investigations  included  load  balancing  problems,  queueing-theoretic 
delay  analyses,  the  impact  of  power  amplifier  characteristics,  and  opportunistic  channel 
access  and  routing  schemes. 
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Section  2.  Description  of  Research  Results 


1  Motivation 

Due  to  the  inherent  non-determinism  and  computational  and  communicational  limitations,  net¬ 
works  of  embedded  systems  require  sound  mathematical  models  that  capture  the  probabilistic  and 
dynamic  character  of  these  networks,  including  the  random  node  distribution  and  the  unknown 
channel  gains  and  interferences.  Of  crucial  importance  is  the  ability  to  predict,  analyze,  and  bound 
performance  for  large  systems.  This  part  of  the  final  report  lists  our  contributions  to  this  problem. 


2  Research  Results 

The  emphasis  was  put  on  (1)  modeling  the  topological  uncertainties  of  sensor  networks  and  (2) 
finding  solutions  to  cope  with  and  manage  the  inherent  non-determinism.  At  the  topology  level, 
there  are  mainly  two  sources  of  uncertainty:  the  node  distribution  and  the  characteristics  of  the 
wireless  channel. 


2.1  Node  distribution  and  internode  distances 


Due  to  the  rapid  decay  of  signal  power  levels  with  distance  in  RF  communications,  uncertainty  in 
the  location  of  the  nodes  results  in  a  compounded  uncertainty  in  the  SINR.  Although  perfectly 
regular  lattices  may  be  useful  assumptions  to  derive  (upper)  performance  bounds,  in  practice,  the 
node  distribution  is  usually  subject  to  some  uncertainty. 


Poisson  random  networks.  A  well  accepted  model  for  the  node  distribution1  is  the  homoge¬ 
neous  Poisson  point  process  of  intensity  A,  where  the  number  of  nodes  in  an  area  A  is  Poisson 
distributed  with  parameter  AX  and  the  node  numbers  in  disjoint  areas  are  independent.  We  derived 
the  following  general  result  on  the  distances  in  Poisson  random  networks  [1]:  For  an  m-dimensional 
network,  the  distance  Rn  between  a  node  and  its  n-th  neighbor  has  the  probability  density  function 
(pdf) 


fRn(r) 


_  -A  Cmr^m(Ac^rm)n 
r(n  —  1)! 


(i) 


where  cmrm  is  the  volume  of  the  m-sphere  of  radius  r.  Assuming  a  path  loss  exponent  of  <a,  the 
(normalized)  mean  energy  consumption  E[i?^]  (for  n-th  nearest  neighbor  routing)  can  be  expressed 
in  closed  form  [2].  Rather  than  mean  values,  maxima  may  be  more  relevant,  in  particular  for  route 
or  network  lifetime.  For  a  nearest-neighbor  route  with  M  hops,  we  proved  that  the  expected 
maximum  energy  consumption  grows  at  least  as  (In  M)a/m.  This  reveals  one  of  the  problems  of 
nearest-neighbor  routing  in  Poisson  random  networks  and  indicates  that  other  routing  strategies 
or  node  distributions  are  preferable. 

Based  on  these  finding,  we  addressed  the  routing  problem  in  large  Poisson  random  networks 
in  [2]  and  compared  the  throughput  performance  of  regular  and  random  topologies  in  [3].  The 
main  conclusion  is  that  networks  with  random  node  distributions  suffer  from  a  substantially  smaller 
end-to-end  throughput. 


1In  particular,  if  nodes  move  around  randomly  and  independently,  or  if  sensor  nodes  are  deployed  from  an  airplane 
in  large  quantities. 
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Quasi-regular  sensor  networks.  Clearly,  more  regular  distribution  may  be  desirable.  In  [4] 
we  introduced  a  model  where  nodes  are  placed  approximately  in  a  regular  grid,  with  the  deviation 
from  the  ideal  grid  point  being  Gaussian  in  both  coordinates,  i.e.,  Ax,  Ay  ~  0,cr2),  such  that 

<j  controls  the  regularity  of  the  grid.  The  distance  of  a  node  from  its  grid  point  is  Rayleigh 
distributed  with  mean  cryTr/2,  and  the  nearest-neighbor  distance  is  Ricean.  The  same  quasi¬ 
regular  distribution  is  obtained  by  the  following  thinning  process:  Starting  with  a  Poisson  random 
network  of  intensity  A  >  1,  pick  for  every  integer  lattice  point  the  node  closest  to  it.  It  is  interesting 
to  note  that  the  distance  of  a  node  from  its  grid  point  is  Rayleigh,  as  it  is  for  the  Gaussian  quasi¬ 
regular  network.  This  strategy  balances  the  energy  consumption  and  the  coverage  by  turning 
Rayleigh  distances  into  Ricean  distances.  The  drawback  is  that  a  higher  number  of  nodes  is 
required  to  achieve  good  regularity.  On  the  other  hand,  by  slightly  shifting  the  lattice  from  time 
to  time,  a  lifetime  benefit  is  achievable. 


2.2  Wireless  channel  models 

The  purpose  of  a  channel  or  link  model  is  to  yield  a  packet  reception  probability  pr  given  the  link 
distance  d  and  the  other  relevant  network  parameters. 

A  link  model  with  fading.  Two  channel  or  link  models  are  prevalent:  the  disk  or  protocol 
model  and  the  so-called  physical  model  Both  are  entirely  deterministic  and  have  severe  shortcom¬ 
ings.  Since  most  networks,  exhibit  some  type  of  fading,  i.e.,  a  stochastic  variation  in  the  received 
signal  power  that  may  be  caused  by  multipath  propagation,  scattering,  or  obstruction,  we  intro¬ 
duced  a  tractable  yet  accurate  packet-level  link  model  in  [2].  This  model  was  incorporated  into 
the  “Prowler”  simulator  that  was  developed  at  Vanderbilt  University,  also  as  part  of  the  NEST 
project.  In  this  Rayleigh  fading  model,  the  packet  reception  probability  pr  —  E/[P[y  ^  ©  |  I]]  can 
be  factorized  into  the  reception  probability  of  a  zero-interference  network  p ^  and  the  reception 
probability  of  a  zero-noise  network  pTr  as  follows: 


pr  —  exp 


QN 

PodoQ 


i= 1 


(2) 


where  k  denotes  the  number  of  interferers,  do  is  the  distance  of  the  desired  transmission,  and  d{ 
the  distances  of  the  interferers. 

Interference  and  throughput.  Interference  is  critical  in  large-scale  sensor  networks,  since  it 
is  the  limiting  factor  for  the  scalability.  Except  for  the  case  where  the  interferers  all  have  the 
same  distance  from  the  receiver,  little  is  known  about  the  cumulated  interference.  We  inter-  and 
extrapolated  the  few  available  results,  characterizing  the  interference  and  the  local  throughput 
for  a  broad  class  of  networks,  specifically  one-  and  two-dimensional  networks  with  random  and 
deterministic  node  placement  (Poisson  or  deterministic/regular),  channel  access  (slotted  ALOHA 
and  TDMA) ,  and  channel  characteristics  (deterministic  and  Rayleigh  fading)  [5] .  Thus  we  captured 
up  to  three  sources  of  non-determinism:  the  node  positions,  the  channel  gains,  and  the  set  of 
simultaneously  transmitting  (interfering)  nodes  (dictated  by  the  MAC  scheme). 

Graphs  with  non-isotropic  connectivity.  It  was  conjectured  that  “the  conditions  for  con¬ 
nectivity  and  coverage  are  insensitive  to  the  shape  of  the  region  that  a  Poisson  point  covers” .  This 
conjecture  was  recently  disproved,  and  it  is  shown  that,  among  all  shapes  with  the  same  area,  the 
disk  is  the  last  to  lead  to  percolation,  i.e.,  the  disk  graph  needs  the  highest  expected  node  degree 
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to  percolate.  A  related,  rather  surprising  result  is  that  transmission  to  a  randomly  oriented  sector 
of  angle  S  and  radius  r  leads  to  percolation  as  soon  as  each  node  can,  on  average,  reach  1  +  o(l) 
neighbors.  This  result  suggests  the  use  of  directional  transceivers,  but  the  problem  of  isolated 
nodes  needs  to  be  avoided2,  since  the  gap  between  percolation  and  connectivity  is  accentuated. 
So,  the  source  and  destination  node  may  have  to  be  designed  to  be  more  powerful.  A  first  attempt 
to  exploit  the  result  is  the  transmission  strategy  that  we  (together  with  Bela  Bollobas  and  his 
collaborators)  proposed  in  [6]. 

2.3  Energy  and  delay  balancing 

In  most  applications  of  networked  embedded  systems,  traffic  is  routed  to  a  common  base  station 
or  access  point.  This  causes  traffic  accumulation  towards  the  base  station,  which  is  a  serious 
problem  affecting  the  lifetime  of  the  network.  In  [7]  we  studied  this  problem  and  suggested  different 
solutions. 

2.4  Long- hop  vs.  short-hop  routing 

It  is  a  non-trivial  problem  to  determine  the  optimum  hop  length  for  routing  in  large  sensor  net¬ 
works.  In  [2,  8,  9]  we  compared  routing  schemes  with  different  hop  lengths  and  concluded  that 
often,  long-hop  routing  offers  superior  performance. 

2.5  Queueing  analysis 

For  applications  requiring  QoS  guarantees,  a  detailed  queueing  analysis  is  needed  for  performance 
analysis.  While  prior  work  was  mostly  restricted  to  delay  means,  we  derived  the  end-to-end  delay 
distributions  [10-12]  for  certain  network  topologies. 

2.6  Power  amplifier  characteristics 

Often,  it  is  wrongfully  assumed  that  a  reduction  of  the  RF  transmit  power  reads  to  a  proportional 
reduction  in  the  actual  energy  consumption.  Since  this  is  not  true  for  practical  power  amplifiers, 
we  carried  out  a  detailed  analysis  in  [13]  that  showed  that  transmission  schemes  without  power 
control  often  offer  better  performance. 

2.7  Managing  uncertainty 

To  cope  with  the  uncertainties  in  large  systems  of  networked  embedded  sensors,  we  proposed  the 
following  approaches:  Redundancy,  Adaptivity,  Diversity,  Opportunism  (RADO).  We  investigated 
the  benefits  of  opportunistic  channel  access  in  [14]  and  proposed  a  simple  formalism  for  the  design 
and  analysis  of  time  and  path  diversity  schemes  in  [15]. 


2 With  an  average  out-degree  of  about  1,  an  arbitrary  node  cannot  reach  anybody  with  probability  e 


:  36%. 
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APPENDIX  A 

AUGUST  20,  2003  FIELD  EXPERIMENT 
REPORT  TO  DARPA 

Red  Force  Tagging  Status  Report 

Objective:  The  objective  of  the  Red  Force  Tagging  Field  Experiment  was  to  recover 
data  from  a  mobile  sensor  that  was  emplaced  upon  an  individual  and  then  use  that  data  to 
reconstruct  the  individual’s  trajectory.  Data  recovery  was  accomplished  using  a  network 
of  MICA2  motes  running  a  set  of  NEST  middleware  services  that  supported  large  data 
transfers.  In  order  to  achieve  this  objective,  the  Notre  Dame  team  developed  the  mobile 
sensor  (called  the  TagMote),  developed  the  middleware  services  supporting  data  transfers 
from  the  TagMote  to  network  motes  and  then  to  a  base  station  (Relay),  developed  the 
trajectory  reconstruction  algorithms,  and  integrated  all  of  these  components  into  a  single 
working  system. 

Description:  The  experiment  was  conducted  on  August  20,  2003  at  MacDill  Air  Force 
Base  (AFB).  The  experiment  had  a  randomly  chosen  subject  from  the  audience  wear  the 
TagMote  and  had  the  subject  take  a  three  minute  walk  in  the  vicinity  of  Lewis  Lake 
(MacDill  AFB).  The  subject  then  returned  to  the  NEST  mote  field  and  the  TagMote 
downloading  its  data  into  a  network  consisting  of  16  MICA2  motes.  The  downloaded 
data  was  stored  to  the  MICA2’s  flash  memory.  During  the  download  a  base  station 
listened  to  the  network  traffic,  picked  out  the  downloaded  data  packets  from  the  TagMote, 
and  stored  this  data  to  a  file.  After  the  TagMote’s  data  was  downloaded,  that  data  was 
then  exfiltrated  from  the  network  motes  to  a  Relay  connected  to  a  PC.  The  exfiltrated 
data  packets  were  then  written  to  a  file.  After  both  data  sets  were  recovered,  the  subject’s 
trajectory  was  reconstructed  from  the  downloaded  packets  and  the  exfiltrated  packets. 

Performance:  The  experiment  was  run  four  times  for  two  separate  groups  of  visitors. 

The  principal  performance  measures  were: 

1 .  Exfiltration  Performance  Measure  (Recovery  Rate): 

The  number  of  exfiltrated  data  lines  /  number  of  downloaded  data  lines 

2.  Download  Performance  Measure  (Time,  Data  Rate): 

Download  Time,  number  of  downloaded  data  bytes  /  second 

3.  Exfiltration  Performance  Measure  (Time,  Data  Rate): 

Exfiltration  Time,  number  of  exfiltrated  data  bytes  /  second. 

4.  Trajectory  Reconstruction  Accuracy: 

Plot  of  reconstructed  trajectories  viewed  against  aerial  map  of  region. 

The  table  shows  the  observed  performance  metrics  for  the  4  trials  conducted  during  the 
experiment.  The  table  shows  that  we  were  successful  in  recovering  99%  of  the 
TagMote’s  Trajectory  data,  with  download  times  on  the  order  of  3  minutes  /  3  minute 
walk  (70  data  bytes  /  second)  and  exfiltration  times  on  the  order  of  1  minute  /  3  minute 
walk  (200  data  bytes  /  second). 
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Recovery  rate 
exfiled/download 

Download 
Time  (sec) 

Download 

Throughput 

(bytes/sec) 

Exfiltration 

Time 

(seconds) 

Exfiltration 

Throughput 

(bytes/sec) 

Trial  1 

99.9% 

200 

84 

69 

242 

Trial  2 

103.7% 

189 

72 

111 

127 

Trial  3 

100.0% 

189 

79 

67 

222 

Trial  4 

99.9% 

184 

63 

55 

213 

Average 

190.5 

74.5 

75.5 

201 

Trial  2  exhibited  some  interesting  things.  First  the  recovery  rate  (ratio  of  exfiltrated 
versus  downloaded  packets)  was  greater  than  100  percent.  This  is  because  the  base 
station  listening  to  the  download  was  unable  to  hear  all  of  the  downloaded  packets.  This 
discrepancy  between  downloaded  and  exfiltrated  data  is  shown  in  the  reconstructed 
trajectories  in  the  appendix.  The  reconstruction  from  the  exfiled  data  in  trial  2  exhibits  a 
small  loop  at  the  upper  end  of  the  walk.  This  loop  is  not  present  in  the  reconstruction 
made  from  the  download  packets  overheard  by  our  base  station.  Clearly  the  packets 
associated  with  this  loop  are  the  packets  that  our  base  station  did  not  overhear.  The  other 
interesting  thing  to  note  is  that  the  exfiltration  time  for  trial  2  was  longer  than  for  the 
other  runs.  This  is  because  the  majority  of  the  data  in  trial  2  was  exfiltrated  over  a  two- 
hop,  rather  than  a  one-hop  connection. 

The  reconstructed  trajectories  had  an  accuracy  that  was  consistent  with  earlier  predictions 
of  20  meters  over  1  kilometer  of  distance  traveled.  The  following  picture  shows  the 
aerial  view  of  the  last  trial’s  trajectory  reconstructed  from  the  exfiltrated  data.  In  this 
particular  field  run,  the  subject  walked  north  along  the  road  bordering  Lewis  Lake  and 
then  spelled  out  US  during  his  walk.  As  can  be  seen  from  the  attached  figure,  the 
reconstructed  trajectories  clearly  show  this  feature  in  the  subject’s  walk.  On  the  basis  of 
these  results,  we  felt  the  field  experiment  successfully  met  its  objective.  Pictures  of  the 
reconstructed  trajectories  for  the  other  3  trials  are  attached  to  the  end  of  this  report. 
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Caveats:  In  order  to  facilitate  a  smoothly  flowing  demonstration,  several  simplifications 
were  adopted  in  the  experiment.  These  simplifications  and  the  justification  for  using 
them  are  itemized  below. 

1.  The  subject’s  walk  was  restricted  to  3  minutes.  This  was  done  so  the  audience 
didn’t  need  to  wait  for  a  long  period  of  time  before  seeing  the  reconstruction.  The 
actual  TagMote  is  capable  of  storing  2  hours  of  walking  data. 

2.  The  subject  was  requested  to  stand  still  during  the  downloading  process.  This 
was  also  done  to  keep  the  length  of  the  experiment  reasonable.  While  we  have 
done  numerous  experiments  with  the  subject  moving  through  the  field  during  a 
download,  we  have  usually  found  that  this  can  dramatically  lengthen  the 
download  time.  The  precise  increase  in  duration  is  difficult  to  predict  and 
depends  greatly  upon  the  subject’s  movements  and  environmental  conditions. 

3.  Network  size  was  restricted  to  16-25  motes.  This  was  done  to  reduce  the  length 
of  the  exfdtration  time.  For  larger  networks,  we  would  expect  a  preponderance  of 
multi-hop  routes  to  form.  As  mentioned  above,  the  exfiltration  rate  for  a  single 
hop  is  200  data  bytes/second.  This  time  increases  linearly  with  the  number  of 
hops  in  the  route  from  the  NestMote  to  the  Relay.  So  we  restricted  our  network 
size  to  ensure  a  preponderance  of  single  and  two-hop  routes.  This  simplification 
greatly  reduced  the  length  of  time  the  audience  had  to  wait  for  the  exfiltrated  data. 

4.  The  experiment  did  not  use  a  localization  service.  Localization  in  the  NEST 
middleware  is  a  difficult  problem  and  we  simply  assumed  that  it  was  already  in 
place.  In  practice,  we  would  also  want  to  implement  some  sort  of  localization 
service. 

5.  Trajectory  Reconstruction  was  not  completely  automated.  We  viewed  this  as  a 
“feature”,  rather  than  a  simplification.  Trajectory  reconstruction  quality  is 
governed  by  a  number  of  algorithm  parameters  that  are  dependent  on  the  subject 
and  the  site  location.  In  practice,  it  is  usually  impossible  to  ensure  that  all  of  these 
parameters  are  known  before  hand.  We  therefore  designed  our  trajectory 
reconstruction  tool  so  a  user  could  interactively  adjust  the  reconstruction 
parameters  to  force  the  trajectory  to  match  known  waypoints  that  the  subject 
passed. 

Many  of  the  preceding  simplifications  were  adopted  to  enhance  the  flow  of  the  field 
experiment  for  the  audience.  We  felt  that  none  of  these  simplifications  fundamentally 
restricted  the  experiment’s  scope. 


Reconstructed  Trajectories 

The  following  pages  show  the  reconstructed  trajectories  from  the  4  field  experiments. 
Experiment  1  (walk  from  UVA  to  ND’s  site) 
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.  -r. ‘41 


Reconstruction  from 
downloaded  data 
1046  packets  /  200  sec 


Notre  Dame’s 
Staging  Area 


Subject  spells 
out  ND 


Reconstruction  from 

exfiltrated  data 

1045  packets  /  69  seconds 


Experiment  2  (subject’s  walk  north  of  ND’s  site) 


Subject  spells 
out  letter  B 


Notre  Dame  s 
Staging  Area 


Reconstruction  from 
downloaded  data 
851  packets  / 189  sec 


Experiment  3  (walk  from  UVA’s  site  to  ND’s  site) 
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Experiment  4  (walk  north  of  ND’s  site) 


Reconstruction  from 
downloaded  data 
734  packets  / 184  sec 


Subject  spells 
out  US 


Notre  Dame’s 
Staging  Area 


Reconstruction  from 

exfiltrated  data 

733  packets  /  55  seconds 
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APPENDIX  B 

Experimental  Results  (June  2004) 


The  Tagmote  code  has  several  working  versions.  The  older  versions  moved  gradually 
from  a  centralized  (one  processor)  implementation  that  was  simply  sending  raw  data  to 
the  network,  towards  a  distributed  (two  processor)  implementation,  in  which  data 
undergoes  some  real-time  processing,  including  a  data  compression  process.  For  a 
detailed  discussion,  see  Sections  4  &  5. 


Tagmote  Design 
Innovations 

Download  Time 
(%  of  walking  time*) 

Exfiltration  Time 
(One-hop  Case) 

One  Processor 
(Aug.03  Demo) 

106% 

31% 

Preproc.  on  master  after 
flash  storage 

156% 

31% 

Preproc.  on  slave  after  flash 
storage 

89% 

31% 

Preproc.  on  slave  before 
flash  storage 

44% 

31% 

As  before  with  differential 
compression 

28% 

17% 

*:  The  total  walking  time  with  2M  bytes  flash  is  up  to  10  hours.  In  fact,  the 
sustainable  walking  time  is  limited  by  the  battery  capacity. 

Table  B.l  Experimental  Results  Comparison  among  Different  Tagmote  Versions 
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1.  INTRODUCTION 

The  ability  to  track  the  position  of  the  user  is  an  essen¬ 
tial  part  of  many  applications  [7,  4,  2].  It  is  well  known 
that  Global  Positioning  System  (GPS)  is  limited  as  a  navi¬ 
gation  aid  by  its  inability  to  provide  static  heading  and  its 
lack  of  availability  when  used  around  obstructions  (terrain 
or  man-made)  or  in  the  presence  of  jamming.  Therefore, 
it  is  necessary  to  develop  a  positioning  system  which  can 
complement  GPS  in  GPS-compromised  areas,  which  is  not 
a  trivial  task. 

With  advances  in  computation,  communication  and  sens¬ 
ing  capabilities,  large  scale  sensor-based  distributed  envi¬ 
ronments  are  emerging  as  a  predominant  mobile  computing 
infrastructure  [1] .  Hundreds  or  thousands  of  small,  inexpen¬ 
sive  and  low-power  sensors,  such  as  Berkeley  Motes  [5],  can 
be  quickly  deployed  to  monitor  a  vast  field.  In  this  work  we 
make  use  of  this  mobile  infrastructure  and  new  product  in¬ 
novations  to  build  a  pedestrian  tracking  system  (PTS)  that 
is  capable  of  working  either  indoors  or  outdoors.  Fig.  1  illus¬ 
trates  the  system  architecture.  The  system  involves  a  Dead 
Reckoning  (DR)  module,  self-organizing  wireless  networks 
(sensor  mote  network)  and  a  map  database.  Dead  reckon¬ 
ing  is  used  by  DR  modules  to  determine  their  positions  and 
trajectories,  and  the  sensor  mote  network  is  used  to  collect 
trajectory  data  and  make  them  available  at  a  base  station 
for  further  processing.  The  DR  module  consists  of  the  Leica 
Geosystems  DMC-SX  three  axes  accelerometer  and  mag¬ 
netic  compass  (Leica  Vectronix  AG  [6])  combined  with  a 
generic  wireless  controller  board,  with  the  radio,  the  pro¬ 
cessing  and  the  power  storage  all  integrated.  The  controller 
board  for  DR  module  and  network  nodes  both  use  the  open- 
source  Berkeley  Motes  with  the  Tiny  OS.  In  the  following,  we 
call  the  DR  module  unit  with  the  generic  board,  NavMote , 
and  a  network  node,  NetMote.  The  NetMotes  play  a  critical 
role  in  NavMote  calibration,  trajectory  data  collection  and 
exfiltration,  all  via  wireless  communication  links.  The  tra¬ 
jectory  data  collected  by  NetMotes  can  be  further  exfiltrated 
to  an  information  center  for  displaying,  map  matching,  and 
other  purposes. 
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personal  or  classroom  use  is  granted  without  fee  provided  that  copies  are 
not  made  or  distributed  for  profit  or  commercial  advantage  and  that  copies 
bear  this  notice  and  the  full  citation  on  the  first  page.  To  copy  otherwise,  to 
republish,  to  post  on  servers  or  to  redistribute  to  lists,  requires  prior  specific 
permission  and/or  a  fee. 
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Figure  1:  The  pedestrian  tracking  system  architec¬ 
ture 

There  are  two  challenges  for  the  PTS:  On  the  NavMote 
side,  the  resource-poor  mote  prevents  using  any  computa¬ 
tionally  expensive  algorithms,  while  size,  weight  and  cost 
constraints  need  to  be  met.  On  the  network  side,  a  large 
amount  of  data  need  to  be  downloaded  from  the  NavMote 
to  one  or  more  of  (unreliable)  NetMotes  and  forwarded  to 
Relay  Mote  with  error-prone  wireless  channels.  In  this  pa¬ 
per,  we  focus  on  the  design  challenges  of  the  NetMote  net¬ 
work.  (For  additional  information  regarding  hardware,  sen¬ 
sors,  and  DR  algorithm,  see  [3].) 

The  present  application  differentiates  itself  from  the  tra¬ 
ditional  sensor  network  applications  in  several  ways.  First, 
large  and  sudden  bursts  of  data  must  be  delivered  to  the  base 
station  with  a  low  latency.  The  transport  of  event  impulses 
is  likely  to  lead  to  varying  degree  of  channel  contention  and 
network  congestion.  Second,  these  trajectory  data  are  not 
redundant,  requiring  a  near  zero  loss-rate.  Third,  but  not 
last,  the  wireless  links  are  highly  unpredictable  and  unre¬ 
liable,  and  bidirectional  connectivity  cannot  generally  be 
achieved.  Commonly  observed  in  the  experiments  are  iso¬ 
lated  nodes/clusters,  leading  to  extended  latency  and  even 
network  deadlocks.  To  address  these  problems,  we  imple¬ 
ment  an  IEEE  802.11  DCF  (RTS/CTS/ACK)  scheme  to 
avoid  channel  contention,  while  guaranteeing  a  low-loss  rate 
delivery  with  aggregated  message  ACKs.  In  our  self-healing 
adaptive  routing  scheme,  every  node  maintains  a  set  of  next- 
hop  candidates.  The  “best”  candidate  is  selected  based  on 
throughput  estimate.  Various  deadlock  cases  due  to  missed 
messages  are  identified  and  countermeasures  are  taken  to 
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eliminates  these  cases. 

2.  THE  SYSTEM  DESCRIPTION 
2.1  Overview 

A  typical  scenario  involves  the  reconstruction  of  the  space/ 
time  path  a  subject  has  taken  from  an  initial  to  a  final  lo¬ 
cation.  While  the  NavMote  carried  by  the  subject  is  out 
of  range  of  the  network,  it  uses  the  flash  memory  to  store 
the  accelerometer  and  compass  data  in  real-time.  Custom 
algorithms  provide  compensation  for  adverse  sensor  orienta¬ 
tion  and  calibration,  and  for  collection  with  possible  coding 
of  data.  When  the  subject  enters  again  into  radio  range  of 
the  network,  the  data  captured  by  NavMote  are  extracted 
automatically  via  wireless  communication  and  deposited  in 
the  NetMotes’s  EEPROM.  These  data  that  are  distributed 
within  the  network  are  then  via  multi-hop  transmissions  sent 
to  a  designated  special  “relay”  or  “destination”  mote  ( Relay- 
Mote )  for  further  processing.  At  this  stage,  we  assume  the 
initial  location  is  known  and  localization  service  is  available 
to  provide  NetMote  positions. 

In  what  follows,  we  discuss  the  technical  consideration 
that  underpin  the  design  of  the  NetMote  network  while  the 
detailed  design  is  presented  in  the  next  subsection. 

The  media  access  control  plays  a  significant  role  in  the 
performance  of  managing  bursts  of  data  in  a  wireless  shared 
medium.  A  number  of  sensor  networks  use  CSMA  or  vari¬ 
ants  for  medium  access.  For  example,  the  TinyOS  plat¬ 
form  uses  a  simple  CSMA  MAC.  CSMA  suffers  from  the 
well-known  hidden  terminal  problem  in  multihop  environ¬ 
ment.  IEEE  802.11  utilizes  an  RTS/CTS  exchange  to  elim¬ 
inate  hidden  terminals.  Although  the  signaling  cost  of  the 
RTS/CTS  exchange  is  high  for  sensor  networks  where  the 
packet  size  is  small,  the  scheme  is  justified  for  the  PTS  ap¬ 
plication  due  to  the  burst  traffic  characteristics;  for  a  three- 
minute  walk,  NavMote  has  approximately  12  KBytes  (com¬ 
pressed)  data,  which  are  often  downloaded  to  several  Net- 
Motes.  Every  NetMote  then  tries  to  transmit  the  data  in 
its  buffer  of  size  128  x  16  Bytes  in  one  RTS/CTS  exchange. 
Hence  the  number  of  RTS/CTS  exchanges  is  quite  small 
compared  to  the  data  size.  Random  delays  are  also  intro¬ 
duced  in  addition  to  backoff  as  suggested  in  [9].  To  further 
reduce  energy  cost,  ACK  is  used  in  an  aggregated  manner, 
not  for  every  single  packet. 

In  the  course  of  network  operation,  NetMote  and  link  fail¬ 
ures  may  lead  to  arbitrary  and  unsupport  topology.  The 
RelayMote  is  required  to  use  the  active  topology  for  extrac¬ 
tion  of  information  from  the  network  and  distribution  of 
control  packets  to  the  NetMotes.  The  algorithm  which  we 
have  used  to  satisfy  the  requirements  of  reliable  routing  is  a 
distributed  leader  election  algorithm.  The  algorithm  is  used 
to  construct  a  minimum-hop  spanning  tree,  such  that  each 
NetMote  knows  a  “parent  node”  closer  to  the  RelayMote.  At 
each  hop,  the  NetMote  receives  a  packet  and  retransmits  it 
to  upper  level.  As  a  preventive  measure,  a  periodic  recompu¬ 
tation  of  minimum  spanning  is  triggered  by  the  RelayMote 
to  account  for  any  changes  in  the  topology.  Furthermore, 
all  the  NetMotes  keep  the  number  of  transmission  retries 
as  the  throughput  estimate.  The  best  parent  node,  which 
has  least  number  of  retries,  is  selected.  Max  retried  times  is 
also  set  to  prevent  from  forwarding  data  to  inactive  or  dead 
NetMotes.  If  there  is  no  “good”  parent  available,  then  the 
NetMote  holds  data  till  the  next  spanning  tree  update.  The 


watchdog  function  is  another  essential  part  of  the  scheme  to 
ensure  a  robust  routing  protocol.  For  instance,  silent  motes 
caused  by  CTS  can  be  unlocked  automatically  even  in  case 
of  missing  messages  or  software  task  deadlines. 

2.2  NetMote  Network  Middleware  Services 

The  software  supporting  the  PTS  application  is  written 
using  TinyOS/NesC  [8].  The  software  is  organized  into  a 
stack  consisting  of  three  layers;  the  application,  middleware 
and  operating  system  or  O/S  layers.  There  are  three  pri¬ 
mary  application  services :  Coordinator.  This  service  man¬ 
ages  the  interaction  of  the  other  application  services;  Dump : 
This  service  downloads  trajectory  data  from  the  NavMote 
to  the  NetMote’s  EEPROM;  Exfile :  This  service  streams 
data  in  a  NetMote’s  EEPROM  to  the  network’s  RelayMote 
(base  station). 

There  are  five  primary  middleware  services :  Ping :  Nav¬ 
Mote  uses  this  service  to  determine  if  it  is  in  the  vicinity  of 
a  NetMote;  Backbone :  This  service  builds  and  maintains  a 
robust  minimum  hop  spanning  tree  from  all  NetMotes  to  the 
RelayMote;  Clock  Synchronization :  This  service  maintains  a 
global  clock  variable  across  all  nodes  in  the  network;  Local¬ 
ization:  This  service  initializes  and  maintains  a  variable  rep¬ 
resenting  the  NetMote’s  physical  position;  Telemetry :  This 
service  periodically  sends  packets  down  the  network’s  back¬ 
bone  to  the  RelayMote.  The  telemetry  packets  contain  in¬ 
formation  about  the  network’s  current  configuration.  This 
service  is  used  by  the  RelayMote  to  build  a  picture  of  the 
entire  network. 

The  O/S  services  are  software  components  interfacing  di¬ 
rectly  to  the  mote’s  physical  resources  such  as  the  UART, 
radio,  sensors,  clock,  random  number  generator,  and  EEP¬ 
ROM  (logger). 

The  services  in  the  middleware  layer  are  responsible  for 
setting  up  and  maintaining  the  network  infrastructure  re¬ 
quired  to  recover  data  from  the  NavMote  in  a  flexible  and 
reliable  fashion.  As  soon  as  a  NetMote  is  reset,  it  starts 
up  these  middleware  services  in  a  specific  order.  The  Clock 
Synchronization  and  Localization  services  are  started  first 
in  order  to  initialize  the  mote’s  clock  and  location  vari¬ 
ables.  Once  these  services  have  stabilized,  a  signal  is  issued 
which  starts  the  Backbone  service.  This  service  automati¬ 
cally  builds  a  minimum  hop  spanning  tree  from  all  NetMotes 
to  the  RelayMote.  The  service  is  designed  to  detect  changes 
in  link  quality  that  would  adversely  effect  the  network  con¬ 
nectivity.  Upon  detecting  such  changes,  the  backbone  ser¬ 
vice  reconfigures  its  spanning  tree  to  restore  network  con¬ 
nectivity.  Finally,  once  the  backbone  service  has  stabilized, 
it  issues  a  signal  which  starts  the  telemetry  service.  The 
telemetry  service  simply  sends  packets  to  the  RelayMote 
that  provide  the  user  with  a  global  view  of  the  network’s 
connectivity. 

The  NavMote  uses  the  Ping  service  to  find  the  network. 
Upon  finding  the  network,  it  uses  the  application  layer’s 
Dump  service  to  download  the  trajectory  data  to  the  net¬ 
work.  The  dump  service  works  in  an  opportunistic  manner, 
dumping  as  much  data  as  possible  to  the  nearest  available 
NetMote’s  EEPROM.  The  download  then  switches  to  an¬ 
other  download  should  the  original  dump  stream  be  inter¬ 
rupted.  Having  multiple  NetMotes  store  the  NavMotes  data 
is  desirable  since  a  single  NetMote  may  not  have  enough 
EEPROM  to  store  all  of  the  trajectory  data  and  the  subject 
may  be  moving  from  being  close  to  one  NetMote  to  being 
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closer  to  a  different  NetMote. 

The  Exfile  service  is  used  to  transmit  data  logged  in  the 
NetMote’s  EEPROM  to  the  RelayMote.  The  coordinator  on 
the  NetMote  ensures  that  the  Dump  and  Exfile  service  oper¬ 
ate  in  a  mutually  exclusive  manner.  In  this  way,  the  Exfile 
service  does  not  begin  until  the  download  from  the  NavMote 
has  stopped.  Upon  starting,  the  Exfile  service  begins  read¬ 
ing  data  stored  in  EEPROM  and  storing  it  in  a  128  line  (16 
Bytes/line)  buffer.  Once  this  buffer  is  full,  the  service  issues 
a  request-to-send  (RTS)  to  the  next-hop  on  the  backbone’s 
spanning  tree.  All  nodes  (except  the  sender  and  next-hop) 
hearing  the  RTS  are  immediately  set  to  a  dormant  state, 
thereby  establishing  a  basic  service  cell  (BSC)  in  which  the 
sender  and  next-hop  can  communicate  with  little  interfer¬ 
ence  from  other  NetMotes.  Upon  receiving  a  clear-to-send 
(CTS)  from  the  next-hop,  the  Exfile  service  begins  transmit¬ 
ting  the  buffered  data,  line  by  line,  to  the  next-hop.  Every 
four  transmitted  lines  is  acknowledged  (ACK)  by  the  next- 
hop.  Should  the  sender  miss  an  ACK  or  the  next-hop  miss 
an  expected  message,  then  they  issue  a  clear  (CLR)  message 
that  resets  the  basic  service  cell  so  it  is  ready  for  the  next 
RTS  message. 

The  Exfile  service  essentially  streams  data  from  all  Net- 
Motes  with  logged  data  to  the  RelayMote.  The  trajectory 
data  transmitted  by  the  Exfile  service  follows  the  same  span¬ 
ning  tree  used  by  the  Telemetry  service  packets.  The  fi¬ 
nal  destination  for  both  message  streams  is  the  RelayMote. 
Upon  being  caught  by  the  RelayMote,  the  packets  are  for¬ 
warded  over  the  RelayMote’s  UART  to  a  PC  that  is  run¬ 
ning  a  Java  graphical  user  interface  (GUI)  called  NetCon- 
sole.  The  GUI  displays  telemetry  packets  that  are  forwarded 
to  the  RelayMote  from  the  network.  The  data  contained  in 
these  packets  allow  the  GUI  to  display  the  NetMote  and 
NavMote  position  and  neighborhoods.  The  data  also  al¬ 
low  the  GUI  to  display  the  routes  and  connectivity  between 
NetMotes.  Finally,  the  NetConsole  dumps  the  decoded  data 
packets  to  a  file  for  subsequent  post-processing. 

3.  CONCLUSIONS 

In  this  paper,  we  describe  the  development  of  a  pedes¬ 
trian  tracking  system  that  uses  three-axes  accelerometer  and 
magnetic  compass  (Leica  Geosystems)  and  a  mote  network 
consisting  of  Berkeley  motes. 

The  system  was  examined  in  both  indoor  and  outdoor  en¬ 
vironments.  One  group  of  experiments  took  place  in  swamp 
terrain.  The  subjects  took  5-minute  or  3-minute  walks  through 
the  test  course.  The  principal  performance  measures  are: 

1.  Exfiltration  Performance  Measure  (Rec  Rate):  The 
number  of  exfiltrated  data  lines  /  number  of  down¬ 
loaded  data  lines. 

2.  Download  Performance  Measure  (DTime,  DTput):  Down¬ 
load  Time,  number  of  downloaded  data  bytes/sec. 

3.  Exfiltration  Performance  Measure  (ETime,  ETput): 
Exfiltration  Time,  number  of  exfiltrated  data  bytes/sec. 

4.  Trajectory  Reconstruction  Accuracy:  Plot  of  recon¬ 
structed  trajectories  viewed  against  aerial  map  of  re¬ 
gion. 

Table  1  shows  the  observed  performance  metrics  for  the 
4  trials  conducted  during  the  test.  A  total  of  21  NetMotes 
are  deployed  in  a  3  x  7  grid,  with  a  grid  size  around  7  feet. 
The  table  shows  that  we  were  successful  in  recovering  99% 


Rec 

Rate 

DTime 

(sec) 

DTput 

(b/s) 

ETime 

(sec) 

ETput 

(b/s) 

1 

99.9% 

200 

84 

69 

242 

2 

100.0% 

189 

72 

111 

127 

3 

100.0% 

189 

79 

67 

222 

4 

99.9% 

184 

63 

55 

213 

Avg. 

190.5 

74.5 

75.5 

201 

Table  1:  System  Performance 


of  the  NavMote’s  trajectory  data,  with  download  times  on 
the  order  of  3min/3min  walk  (70  data  bytes/sec)  and  ex¬ 
filtration  times  on  the  order  of  lmin/3min  walk  (200  data 
bytes/sec).  Extensive  experiments  show  that  the  accuracy 
of  walk  distance  estimate  is  with  ±3%  and  heading  accuracy 
d=l° . 

Our  experiences  tell  us  that  sensor  nodes  and  wireless 
links  are  much  unpredictable  and  unreliable  than  one  might 
think.  Hence  a  simple  yet  robust  MAC  scheme/routing  pro¬ 
tocol  are  critical  to  any  sensor  network  application.  We  fo¬ 
cus  more  on  the  high  throughput  and  low  loss-rate  aspects, 
without  fully  addressing  the  energy-aware  network  opera¬ 
tion,  which  is  in  some  sense  more  important  for  the  sen¬ 
sor  network  applications.  Other  practical  challenges  include 
node  localization,  network  security,  privacy,  and  authenti¬ 
cation.  We  believe  however  that  our  development  offers  a 
vision  of  future  mobile  environments  that  may  emerge  once 
ubiquitous  wireless  coverage  becomes  available. 
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ABSTRACT 

Overload  management  policies  avoid  network  congestion  by 
actively  dropping  packets.  This  paper  studies  the  effect 
that  such  data  dropouts  have  on  the  performance  of  spa¬ 
tially  distributed  control  systems.  We  formally  relate  the 
spatially-distributed  system’s  performance  (as  measured  by 
the  average  output  signal  power)  to  the  data  dropout  rate. 
This  relationship  is  used  to  pose  an  optimization  problem 
whose  solution  is  a  Markov  chain  characterizing  a  dropout 
process  that  maximizes  control  system  performance  subject 
to  a  specified  lower  bound  on  the  dropout  rate.  We  then 
use  this  Markov  chain  to  formulate  an  overload  management 
policy  that  enables  nodes  to  enforce  the  ’’optimal”  dropout 
process  identified  in  our  optimization  problem.  Simulation 
experiments  are  used  to  verify  the  paper’s  claims. 
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1.  INTRODUCTION 

A  sens  or- actuator  network  is  a  collection  of  sensors  and 
actuators  that  exchange  information  over  a  shared  commu¬ 
nication  network.  Such  networks  can  be  used  in  active  struc¬ 
tural  acoustic  control  [3]  or  other  spatially  distributed  con¬ 
trol  systems  [1]  [2].  An  example  of  such  a  system  is  shown 
in  Figure  1.  This  system  [1]  is  a  linear  array  of  masses  that 
are  linked  together  by  springs.  Each  mass  (node)  has  an  em¬ 
bedded  processor  attached  to  it  that  can  measure  the  node’s 
local  state  (position)  and  then  transmit  that  information  to 
its  neighboring  nodes.  The  712th  node’s  position  at  time 
7ii  is  denoted  as  xni,n2 •  This  transmitted  information  is 
then  used  by  the  embedded  processor  to  compute  a  control 
input.  The  control  input  to  the  712th  node  at  time  71 1  is 
denoted  as  unijn2.  Distributed  controllers  for  such  systems 
can  be  designed  using  a  host  of  methods.  Most  of  these  con¬ 
trollers,  however,  assume  there  is  no  loss  of  data  through 
the  network.  This  paper  examines  the  effect  dropouts  have 
on  overall  control  system  performance  and  devises  policies 
for  optimally  managing  these  dropouts. 


Communication  network 


Figure  1:  Distributed  Control  System  and  Sensor- 
Actuator  Network 

Data  dropouts  occur  for  many  reasons,  but  this  paper  con¬ 
fines  its  attention  to  networks  in  which  1)  data  are  dropped 
due  to  link  failures  or  2)  data  are  purposefully  dropped  in 
order  to  stay  within  a  specified  throughput  allocation  where 
throughput  is  defined  to  be  the  average  number  of  feedback 
measurements  per  second.  The  first  source  of  data  dropouts 
cannot  be  controlled  by  the  nodes  and  we  assume  that  such 
dropouts  occur  at  a  rate  /  that  is  statistically  independent 
of  the  data  source.  This  model  for  link  failure  may  be  ap¬ 
propriate  for  networks  in  which  small  sets  of  nodes  share  the 
same  physical  medium.  Examples  of  such  networks  include 
RF  (radio)  wireless  networks  as  well  as  wired  networks  in 
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which  a  node  is  connected  by  wire  to  it  nearest  neighbors. 
In  such  networks,  the  most  likely  cause  for  link  failure  arises 
from  collisions  between  packets  in  the  medium. 

The  second  source  of  dropouts  occurs  because  overload 
management  policies  require  nodes  to  purposefully  drop  pack¬ 
ets  to  prevent  network  congestion.  In  particular,  the  net¬ 
work  allocates  a  portion  of  its  available  throughput  to  each 
node.  Each  node  then  selectively  drops  packets  to  stay 
within  this  throughput  allocation.  Previous  work  in  over¬ 
load  management  [9]  has  focused  on  the  use  of  heuristic 
policies  such  as  the  (m,  /c)-firm  guarantee  rule.  This  paper 
provides  an  alternative  approach  to  overload  management 
that  has  provable  guarantees  on  the  attainable  level  of  ap¬ 
plication  performance. 

In  this  paper,  “application  performance”  refers  to  the  per¬ 
formance  delivered  by  a  spatially  distributed  control  system. 
Figure  2  illustrates  a  control  system  that  consists  of  a  con¬ 
troller  K(z)  and  plant  G(z)  that  are  connected  in  a  feedback 
loop.  In  our  case,  the  feedback  loop  is  implemented  over  a 
communication  network.  The  input  to  the  control  system 
is  a  signal,  ic,  that  represents  either  an  exogeneous  noise  or 
disturbance  that  is  exciting  the  plant.  The  controller  K(z) 
minimizes  the  impact  that  this  disturbance  w  has  on  the 
system’s  output,  y.  In  particular,  we  seek  to  minimize  the 
power  in  this  output  signal,  y.  We  sometimes  refer  to  this 
as  a  generalized  regulator  problem. 


W 


_  _  L(z)_  _  _ 


TiH 

K(z) 

G(z)  - 

T  : 

network 

◄ - 

y 

Figure  2:  Generalized  Regulator  over  a  Network 

The  control  system  shown  in  Figure  2  is  an  abstract  model 
of  the  physical  system  shown  in  Figure  1.  For  the  physical 
system,  the  plant  is  the  structure  consisting  of  masses  and 
springs.  The  controller  is  implemented  in  the  embedded  pro¬ 
cessors  generating  the  input  signals  driving  the  individual 
masses.  The  network  is  the  communication  channel  between 
the  embedded  processors.  Essentially  we  can  view  this  as  an 
acoustic  vibration  suppression  problem.  The  external  signal 
ic  is  a  noise  source  that  is  exciting  the  physical  structure. 
The  embedded  processors  measure  the  resulting  vibration 
in  the  structure  and  use  these  measurements  to  generate  a 
control  that  tries  to  reduce  that  vibration.  So  the  measure 
of  this  control  system’s  performance  is  the  average  power  in 
the  physical  structure’s  state  signal. 

This  paper  examines  the  impact  that  communication  net¬ 
work  Quality-of- Service  (as  measured  by  feedback  measure¬ 
ment  throughput)  has  on  the  control  system’s  performance 
(as  measured  by  the  power  in  the  structure’s  vibration).  In 


particular,  traditional  control  system  design  assumes  that 
feedback  measurements  are  being  received  by  the  controller 
at  regular  time  intervals.  Requiring  that  all  feedback  mea¬ 
surements  are  successfully  transmitted  over  the  network  re¬ 
sults  in  a  hard  real-time  control  system.  Meeting  hard  real¬ 
time  deadlines  is  extremely  difficult  in  communication  net¬ 
works  where  several  embedded  processors  share  the  same 
communication  channel.  This  paper  breaks  the  tyranny  of 
periodic  hard  real-time  scheduling  by  asking  and  answering 
the  following  questions.  Can  we  characterize  the  control  sys¬ 
tem’s  performance  as  a  function  of  the  average  throughput 
rate?  Is  it  possible  to  distribute  dropped  feedback  measure¬ 
ments  in  a  way  that  has  the  least  impact  on  application 
(control  system)  performance?  The  answer  to  both  ques¬ 
tions  is  in  the  affirmative  and  this  paper  shows  how  those 
answers  apply  in  sensor-actuator  networks  used  to  control 
the  spatially  distributed  system  shown  in  Figure  1. 

The  remainder  of  this  paper  is  organized  as  follows.  Sec¬ 
tion  2  presents  the  spatially  distributed  system  studied  in 
this  paper.  Section  3  discusses  prior  work  relating  control 
system  performance  and  data  dropouts.  This  prior  work 
only  pertains  to  single  control  loops,  so  section  4  extends 
that  work  to  the  spatially  distributed  system  in  figure  1. 
Section  5  uses  these  results  to  formulate  an  ’’optimal”  over¬ 
load  management  policy  and  the  behavior  of  this  policy  is 
demonstrated  on  a  simulation  of  a  27-node  structure.  Final 
remarks  are  found  in  section  6.  The  theorems’  proofs  will 
be  found  in  section  7. 


2.  SPATIALLY  DISTRIBUTED  CONTROL 
SYSTEM 

This  paper  confines  its  attention  to  the  spatially  distributed 
system  shown  in  Figure  1.  We  first  assume  that  the  continuous- 
state  dynamics  of  the  system  have  been  discretized  in  time. 

So  we  let  xni,n2  denote  the  state  of  the  712th  node  at  time 
instant  71 1 .  The  state  x  is  a  2- vector  characterizing  the  posi¬ 
tion  and  velocity  of  the  node  with  respect  to  its  equilibrium 
position.  The  discrete  state  satisfies  the  following  recursive 
equations, 

Xn\-\-l,ri2  —  AXn\,n2  T  ^  (^ni  ,712  —  1  T  3?ni,ri2  +  l) 

-\-F(uni:n2  yJni,n2^) 

Zni,ri2  —  Cxni  :n2  (i) 

for  m  >  0  and  any  712-  zni,n2  1S  an  output  signal  that  is  used 
to  characterize  overall  system  performance.  A,  B,  C  and  F 
are  appropriately  dimensioned  real- valued  matrices.  There 
are  two  inputs  to  this  equation;  the  disturbance  wni,n2  and 
the  control  unijn2.  The  disturbance  is  a  zero- mean  white 
noise  process  in  both  time  and  space.  The  control  input  is 
computed  by  the  embedded  processor. 

The  examples  in  this  paper  confine  their  attention  to  1- 
dimensional  structures  in  which  there  is  only  a  single  spatial- 
variable,  772-  Note  that  this  work  may  be  easily  extended  to 
2-d  chains  through  the  introduction  of  an  additional  spatial 
variable.  Detailed  models  for  such  systems  will  be  found  [2] 

Each  node  has  a  processor  attached  to  it.  The  processor 
measures  the  node’s  local  state  xnijn2  and  it  transmits  this 
information  to  its  neighbors  upon  request.  We  assume  that 
the  nodes  are  synchronized  in  time  and  that  in  each  sam¬ 
pling  interval  the  node  decides  whether  or  not  to  access  its 
neighbor’s  state.  This  means  that  a  node  first  ’’requests” 
that  its  neighbors  send  data  to  it  and  then  the  processor 
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computes  its  control  input  unijn2  upon  receiving  this  data. 
If  neighboring  state  information  has  been  received,  then  the 
control  input  is  computed  according  to  the  following  equa¬ 
tion, 


The  control  objective  is  to  minimize  the  noise  power  in  the 
node’s  state.  So  a  natural  measure  of  application  perfor¬ 
mance  is  the  average  power,  ||z||p,  in  the  node’s  output  sig¬ 
nal  znijn2.  This  power  is  usually  written  as 


—  Kq  Xn 


T  K\{Xn 


-1  T  Xni  ,n2  +  l) 


(2) 


where  Kq  and  K\  represent  control  gain  matrices  that  have 
been  chosen  by  the  control  engineer.  Since  our  network  may 
occasionally  drop  packets,  the  processor  needs  to  use  a  dif¬ 
ferent  control  signal  if  the  neighboring  state  data  is  not  re¬ 
ceived.  In  this  case,  the  processor  simply  sets  uni,n2  —  0. 

Data  will  always  be  dropped  by  the  network  for  two  rea¬ 
sons.  The  first  reason  is  that  the  medium  is  unreliable.  A 
transmitted  packet  has  a  finite  probability  /  of  being  lost 
due  to  link  failure.  This  probability  is  assumed  to  be  sta¬ 
tistically  independent  from  the  state  of  the  packet’s  source. 
Dropouts  will  also  occur  because  a  node  explicitly  decides 
NOT  to  request  neighboring  state  measurements.  This  oc¬ 
curs  because  an  overload  management  policy  requires  nodes 
to  drop  a  certain  percentage  of  packets  when  the  network  is 
congested.  In  particular,  the  network  allocates  a  specified 
amount  of  its  throughput  to  each  node  which  we  character¬ 
ized  as  a  lower  bound,  £d,  on  the  node’s  actual  dropout  rate. 
The  size  of  £d  depends  on  the  amount  of  network  congestion. 
Overload  management  through  packet  dropping  clearly  has 
an  adverse  impact  on  overall  application  performance.  This 
is  particularly  true  for  hard  real-time  feedback  control  sys¬ 
tems.  This  paper  determines  an  overload  management  pol¬ 
icy  that  is  ’’optimal”  in  that  it  maximizes  application  (con¬ 
trol  system)  performance  while  ensuring  the  dropout  rate 
does  not  drop  below  Ed- 

Because  dropouts  cause  us  to  switch  between  two  different 
control  laws,  the  system’s  state  space  model  takes  the  form 
of  a  jump  linear  system  [7].  In  particular,  let’s  define  a 
dropout  process  that  is  denoted  as  dni,n2-  It  is  a  binary 
random  process  in  which  dni,n2  =  1  if  a  dropout  occurs  and 
is  zero  otherwise.  Under  the  dropout  process,  our  system 
equations  take  the  form, 

Xni  +  l,n2  —  An-^  ,n2  Xn\ ,n2  H-  ^i,n2  (^ni,n  2  — 1  4~~ 
Xni,n2  +  l)  +  T IX n i  ,n2 

Zni,n2  Cxnijn2  (3) 


where  Ani,n2  and  Bniin2  are  matrix  valued  random  pro¬ 
cesses  such  that 


A 


n i  ,n2 


B 


n i  ,n2 


B1 


A  +  FK0 

if  no  dropouts  occur 
(i.e.,  dni,n2  —  0) 

A 

if  a  dropout  occurs 

(z.e.,  dni  ?n2  —  1) 

B  +  FKi 

if  no  dropouts  occur 
(i.e. ,  dni  jTi2  —  0) 

B 

if  a  dropout  occurs 
(i.e.,dni,n2  —  1) 

Application  performance  will  be  measured  by  the  aver¬ 
age  power  in  the  control  system’s  output  signal.  This  is  a 
standard  measure  of  performance  for  regulation  problems. 
In  our  case,  we  want  to  suppress  the  effect  that  the  distur¬ 
bance  Wn\  ,n2  has  on  the  system’s  shape.  In  particular  we 
assume  that  w  is  a  white  noise  process  whose  covariance 
matrix  is 


B  —  ^2 


w 


T 

n  i  ,n2 


} 


||z||p  =  TraceE {zni,„2z^lin2}  =  Trace  [C'P0C'TJ 

where  P o  is  the  covariance  matrix 

Po  E{xni  ,n2  xni  n2 }  (4) 

Note  that  throughout  this  paper  we  are  assuming  that  all 
nodes  in  the  system  are  ’’identical”,  so  that  the  above  co- 
variance  matrix  is  independent  of  m  and  712.  Our  problem 
is  to  find  a  way  of  evaluating  Po  as  a  function  of  the  dropout 
process,  dni,n2- 

Throughout  this  paper,  the  dropout  process  dni,n2  will 
be  generated  by  an  N- state  Markov  chain  with  states  q± 
through  <pv,  transition  probability  matrix  Q  =  [qij]Nx N, 
and  stationary  distribution  tv  =  [7Ti,  7r2,  •  •  •  ,  7Tjv].  Each  node 
of  our  system  will  instantiate  a  copy  of  this  Markov  chain. 
The  state  of  the  Markov  chain  at  node  712  at  time  ni  will  be 
denoted  as  qni,n2 •  The  Markov  chain  generates  the  dropout 
process  through  the  function  h  that  maps  each  Markov  state, 
qi ,  onto  either  0  or  1.  The  dropout  process  is  then  defined  by 
the  equation  dni,n2  —  h(qni,n2)-  We  can  therefore  see  that 
the  matrix- valued  processes  Ani,n2  and  Bn 1>n2  will  take  val¬ 
ues  based  on  the  value  of  qni,n2  •  In  particular,  if  qni,n2  —  qi , 
then  we’ll  denote  Ani,n2  and  Bn 1;U2  as  Ai  and  Pj,  respec¬ 
tively. 

3.  PRIOR  WORK 

There  is  a  small  amount  of  work  studying  the  effect  of 
dropouts  on  the  performance  of  networked  control  systems. 
Nearly  all  of  this  work  has  confined  its  attention  to  single 
control  loops,  rather  than  the  multiple  coupled  control  loops 
found  in  this  paper’s  application.  Early  work  in  [8]  treated 
the  dropout  process  as  a  Markov  chain  and  developed  ad  hoc 
schemes  for  dealing  with  dropouts.  In  [12],  the  system  with 
dropouts  was  treated  as  an  asynchronous  switched  system 
and  switched  system  analysis  methods  were  used  to  assess 
overall  system  stability.  Much  of  the  earlier  work  focused 
on  system  stability.  More  recently,  there  have  been  papers 
examining  system  performance  as  a  function  of  the  dropout 
process.  In  [4],  this  was  done  for  a  simple  networked  control 
system  in  which  dropped  measurements  were  replaced  by 
zeros.  In  [10],  system  performance  was  measured  by  its  7ioo 
gain  and  this  gain  was  evaluated  as  a  function  of  packet  loss. 
Similar  results  were  obtained  in  [5]  and  [6]  for  networked 
control  systems  in  which  the  dropout  process  was  modelled 
as  a  Markov  chain. 

The  results  in  [10]  and  [6]  are  of  particular  importance 
to  this  paper  because  they  provide  formal  relationships  be¬ 
tween  system  performance  and  dropout  rates.  This  relation¬ 
ship  was  used  [6]  to  pose  an  optimization  problem  whose 
solution  (if  it  exists)  is  a  dropout  process  maximizing  con¬ 
trol  system  performance  (average  output  power)  subject  to 
a  lower  bound  on  the  average  dropout  rate.  The  top  plot 
in  Figure  3  shows  results  from  this  prior  work.  This  plot 
compares  the  performance  of  the  system  under  three  differ¬ 
ent  stochastic  dropout  policies;  optimal,  i.i.d,  and  a  ’’soft” 
(777,  k)- firm  guarantee  rule  [9]  .  In  this  example  the  system 
is  closed  loop  stable  and  open  loop  unstable  (see  [6]  for  de¬ 
tails).  The  Figure  plots  the  average  output  power  as  a  func- 
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tion  of  the  allowed  transmission  rate  (i.e.  1  minus  the  aver¬ 
age  dropout  rate).  As  the  transmission  rate  decreases,  the 
average  power  increases  (worse  performance)  and  asymptot¬ 
ically  approaches  infinity.  The  results  show  that  an  overload 
management  policy  enforcing  the  optimal  dropout  process 
performs  better  (lower  average  output  signal  power  over  a 
wider  range  of  dropout  rates)  than  policies  enforcing  the  i.i.d 
or  (ra,  &)-firm  guarantee  dropouts  processes. 


Figure  3:  Performance  of  Various  Overload  Man¬ 
agement  Policies 

The  results  in  Figure  3  are  particularly  interesting  because 
the  behavior  of  the  optimal  dropout  process  runs  counter  to 
prevailing  intuition.  The  bottom  pictures  in  Figure  3  are 
the  state  diagrams  for  the  optimal  and  ’’soft”  (2,  3)  dropout 
processes.  The  state  of  each  Markov  chain  is  denoted  as 
dm  —  i  dm ,  so  that  the  state  marked  ”11”  means  that  the  last 
two  packets  were  dropped.  Note  that  the  optimal  dropout 
process  requires  that  if  a  single  dropout  occurs,  we  must 
immediately  drop  the  second  packet  as  well.  The  soft  (2,3) 
rule,  however,  requires  that  only  one  packet  be  dropped  in  a 
row.  The  results  show  that  for  this  particular  system,  it  is  al¬ 
ways  better  to  drop  two  packets  than  one.  This  runs  directly 
counter  to  the  prevailing  wisdom  embodied  in  the  (m,k)~ 
hrm  guarantee  rule.  In  the  (m,  /c)-hrm  guarantee  rule,  we 
require  that  m  out  of  k  consecutive  packets  get  through 
the  network.  Experimental  results  in  [9]  showed  that  over¬ 
load  management  policies  enforcing  this  rule  perform  well 


on  some  specific  examples.  This  prior  work,  however,  pro¬ 
vides  little  formal  analysis  to  suggest  that  this  is  necessarily 
the  best  policy  to  follow  for  all  control  systems.  The  re¬ 
sults  in  figure  3  show  that  ’’soft”  variations  on  the  (m,k) 
heuristic  may  sometimes  perform  poorly  and  show  that  it  is 
indeed  possible  to  derive  overload  management  policies  that 
optimally  utilize  the  throughput  allocated  to  the  controller. 

Results  of  this  type  are  clearly  relevant  to  overload  man¬ 
agement,  for  they  provide  a  systematic  method  by  which 
optimal  management  policies  might  be  formulated.  The  re¬ 
sults  in  [10]  and  [6],  however,  are  not  directly  applicable  to 
the  system  shown  in  Figure  1  because  that  work  only  per¬ 
tains  to  single  control  loop  whose  feedback  is  implemented 
over  a  network.  In  systems  controlled  by  sensor- actuator 
networks,  there  are  a  large  number  of  control  loops  that 
are  coupled  through  their  environment.  This  is  precisely 
the  situation  encountered  in  Figure  1.  In  our  system,  every 
node  has  an  embedded  processor  that  implements  a  local 
controller.  The  controller’s  actions  are  determined  by  the 
neighboring  node  states  and  those  states  are  in  turn  influ¬ 
enced  by  the  local  controller  through  physical  interactions 
in  the  environment.  If  we  are  to  develop  optimal  overload 
management  policies  we  will  need  to  extend  that  prior  work 
to  this  particular  class  of  spatially  distributed  systems. 

4.  PERFORMANCE  OF 

SPATIALLY  DISTRIBUTED  SYSTEMS 

This  section  states  two  new  results  concerning  the  per¬ 
formance  of  the  distributed  system  in  Figure  1.  Theorem 

4.1  characterizes  the  average  output  power  P o  for  the  dis¬ 
tributed  system  without  control  (i.e.,  uni,n2  —  0)-  Theorem 

4.2  extends  theorem  4.1  to  the  jump  linear  systems  found 
in  equation  3.  This  section  simply  states  the  theorems  and 
comments  on  their  significance.  Formal  proofs  of  the  theo¬ 
rems  will  be  found  in  section  7. 

The  first  theorem  characterizes  Po  for  a  non-switching 
spatially  distributed  control  system.  In  particular,  it  states 
that  if  the  system  is  stable  in  the  mean  square  sense  (i.e. 
^‘{xn1,n2x'ni,n2  }  <  oo),  then  Po  is  obtained  from  the  solu¬ 
tion  of  an  infinite  system  of  linear  equations. 

Theorem  4.1.  Letxniin2  satisfies  equation  1  without  con¬ 
trol  input  (i.e.,  uni,n2  =  0)  where  wni,n2  is  a  zero  mean 
white  noise  process  with  covariance  matrix  R. 

IfE{Xn1^n2xnijn2}  <  oo  (i.e.  stability  in  the  mean  square 
sense)  ,  then  P o  (see  Eq.  f)  is  obtained  by  solving  the  fol¬ 
lowing  system  of  equations. 

Po  =  APoAt  +  2BP0Bt  +  A(P!+  Pi)Bt  (5) 
+B(P1+PT1)AT  +  B(P2  +  pI)Bt  +  R 
Pi  =  AlPiAt  +  A(P2  +  Po)BT  +  B(P2  +  Po)AT 
+B(pf  +  2Pi  +P3)BT 
Pk  =  APkA  +  P(Pfc  — 2  +  2P  k  +  Pk+2)B 

+A(P  fc+i  +  P  k—i)BT  +  B(P  fc+i  +  P  k-i)AT 
where  the  last  equation  applies  for  k  >  2. 

Remark:  The  assumption  of  stability  is  essential  in  this 
theorem.  This  paper  does  not  determine  sufficient  condi¬ 
tions  for  the  system  to  be  stable  in  the  mean-square  sense, 
but  related  results  have  been  obtained  in  [5]  for  single  net¬ 
worked  control  loops.  Similar  stability  conditions  may  be 
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applicable  for  this  distributed  systems,  but  we  have  not  in¬ 
cluded  such  results  in  this  paper  due  to  space  limitations. 

Equations  5  is  an  infinite  system  of  equations  in  which 
you  must  solve  for  the  matrices  Pk  for  k  —  0, . . . ,  oo.  P o 
was  defined  in  equation  4.  The  other  matrices  represent  the 
expectation 

Pk  j7t,2  ,n2  —  k }  (6) 

Pk  =  P-k 

Note  that  because  all  nodes  are  identical,  this  system  is  shift 
invariant  with  respect  to  both  m  and  n 2.  This  is  why  we 
can  drop  any  explicit  mention  of  m  and  712  in  equation  6. 

Remark:  The  infinite  set  of  equations  (Eq.  5)  may  be 
solved  numerically  in  a  recursive  manner.  In  particular,  we 
generate  a  sequence,  {P  k[L]}^=0 ,  of  matrices  that  converge 
to  the  true  Pk  as  L  goes  to  infinity.  The  recursion  we  used 
is 

Po[L+l]  =  AP0[L]At +  2BP0[L]Bt +  R  (7) 

+A(P1[L]  +  P1[L}t)Bt  +  B(P1[L] 
+P1[L]t)At  +  B(P2[L}+P2[L]t)Bt 

Pi[L+1]  =  AP1[L]At +  A(P2[L\+P0[L])Bt 

+B(P2[L} +  Po[L])At 
+B(P f  [L\  +  2P0[L\  +  P3[L])Bt 

Pk[L  + 1]  =  APk[L]AT +  A(Pk+1[L]+Pk-1[L]BT 

+B(Pk+1[L]+Pk-1[L\)AT 
+B(Pk-2[L ]  +  2  Pk[L\  +  Pk+2[L])BT 

where  we  let  Po[0]  =  R  and  Pk[ 0]  =  0  for  k  ^  0.  The  re¬ 
cursion  is  terminated  when  ||Po[T  +  1]  —  Po[L]  ||  is  less  than 
a  specified  error  tolerance.  This  recursion  is  convergent  for 
this  particular  example.  It  is  still  an  open  question,  how¬ 
ever,  under  what  general  conditions  the  proposed  recursion 
is  convergent. 

We  extend  Theorem  4.1  to  spatially-distributed  systems 
with  dropouts.  Such  systems  are  modelled  as  jump  linear 
systems  characterized  by  equation  3.  This  theorem  provides 
an  infinite  set  of  equations  that  can  be  used  to  solve  for  the 
covariance  matrix, 

Po  E{xni  ,n2  xni  n2  } 

Once  again  because  of  spatial  shift- invariance,  Po  is  inde¬ 
pendent  of  n 2.  The  proof  for  this  theorem  will  be  found  in 
section  7. 

Theorem  4.2.  Let  icni,n2  be  a  zero-mean  white  noise  pro¬ 
cess  with  covariance  R.  Let  xni,n2  satisfy  the  jump  linear 
system  equation  given  in  equations  3  which  is  driven  by  a 
Markov  chain  with  transition  matrix  Q  =  [qij]NxN  with  sta¬ 
tionary  distribution  tv  =  [7Ti,  7T2,  •  •  •  ,  7Tjv]  • 

If  Ei{xnljn2Xni,n2}  <  °o  (i.e.  mean  square  stability)  , 
then 

n 

Po  ,n2  xn^  n2  }  ^  ^  Pq 


where  PqZ  satisfy  the  following  infinite  set  of  equations. 

Po’*  =  AiS^Af  +  t ViBiiflSd'  +  S'p  +  Sp2)Bl  (8) 
+MSP  +  +  BiiS?  +  S'l\)Aj  +  t nR 

P\j  W  Bi(Pjl i  +  7 TjS?  +  TViTVjS-p  +  7 TiSt)Bj 
+7 VjMSP  +  SP)Bj  +  AiS[JAj 
+7 ViBi^Sfj  +  S0,j)Aj 

Pk°  =  AiS^Aj  +  t TiizjBiQSz  +  S-£_2  +  S^2)Bj 

+7t jAi(S]j+1  +  SkL^Bj  +  7ViBi(S^J+1  +  S^f^Aj 

for  k  >2,  where  : 

.  .  (  EN=i  Qii9rnjPkm  ,  when  k^O 

$k  ~  ^  J2iLi  QnPd  >  when  k=0  and  i=j 

I  0  ,  when  k=0  and  i  ^  j 

N  N  N 

qP  —  C-,J  _  V"  q"’-7'  q->-  —  qi,j 

3  =  1  *=1  *0  =  1 

Remark:  Note  that  there  are  strong  similarities  between 
the  equations  8  of  theorem  4.2  and  equations  5  of  theorem 
4.1. 

Remark:  Equations  8  is  an  infinite  set  of  linear  equations 
that  we  solve  for  the  matrices  P%k  •  In  particular,  these 
matrices  are  the  following  conditional  expectations. 


nj  = 

TViTV jPi\Xn1,n2Xn1,n2  —  k  \  —  l,n2  —  Qi: 

qni_ljri2_k  =  qj}  for  k  ^  0 

K’1  = 

7ViT}j-{xnijn2Xn1,n2  1  Qn1—l,n2  =  Qi } 

Ploj  = 

0  for  i  ^  j 

II 

ST 

(pif 

We  can  again 

use  a  recursion  similar  to  that  shown  in 

tion  7  to  solve  these  equations. 

5.  OPTIMAL  OVERLOAD  MANAGEMENT 
POLICIES 

Theorem  4.2  presents  a  method  for  computing  the  covari¬ 
ance,  Po,  of  a  node’s  state  vector  as  a  function  of  the  dropout 
process’  transition  matrix,  Q.  Since  we  take  Trace (CPoCT) 
to  be  a  measure  of  the  local  controller’s  performance,  these 
results  provide  the  required  extension  of  [6]  to  spatially  dis¬ 
tributed  control  systems.  We  can  now  use  this  result  to 
formulate  and  solve  a  problem  to  find  the  Markov  chain 
that  minimizes  Trace (CPqCt)  subject  to  a  lower  bound  £d 
on  the  chain’s  average  dropout  rate.  The  resulting  Markov 
chain  will  then  be  used  to  devise  an  ’’optimal”  overload  man¬ 
agement  policy. 

To  state  our  problem,  we  need  to  consider  a  specific  Markov 
chain.  In  particular,  let’s  assume  the  Markov  chain  has  four 
states  defined  as  follows: 

qi  |  dni  —  i,n2  —  0,  dnij77,2  —  0 

q2  |  dni  —  i,n2  —  0,  dnij77,2  —  1 

Q3  |  dni—l,ri2  —  1  ?  dn^  ;n2  —  0 

Q4  |  dm—l,ri2  —  1  ?  dn^  ;7t,2  —  1 

When  qni,n2  —  Qi:  then  the  probability  that  the  next  packet 
will  be  dropped  on  purpose  is  e*.  In  other  words  £i  is  the 
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probability  that  the  node  decided  NOT  to  request  data  from 
its  neighbors.  The  total  dropout  probability  (including  the 
effect  of  link  failures)  will  be  1  —  (1  —  £i)(l  —  /)  where  /  is 
the  link  failure  rate.  With  these  notational  conventions,  the 
probability  transition  matrix  for  the  Markov  chain  becomes, 


£\ 

£2 

0 

0 

0 

£3 

£5 

e6 

0 

0 

0 

£7 

ei  =  (l-ei)(l-/) 

e3  =  (l-e2)(l  -/) 
es  =  (1  —  e3)(l  —  /) 
e-j  =  (l-e4)(l  -/) 


0 


£4 

0 

e8 

£2 

=  £1 

+ 

/( 1- 

~£i) 

£4 

=  £2 

+ 

/(I  - 

-£2) 

£6 

=  £3 

+ 

/(l  - 

-S3) 

£ 8 

=  £4 

+ 

/(l- 

-  £4) 

The  steady  state  distribution  is  denoted  as  tv  =  [tv\,  7r2, 713,714] . 

The  average  dropout  rate ,  e  is  given  by  the  following  equa¬ 
tion 


4 

£  =  y^TTiEi. 
i= 1 


As  discussed  earlier,  the  network  will  limit  a  node’s  trans¬ 
mission  rate  in  order  to  avoid  congestion.  This  limitation 
takes  the  form  of  a  lower  bound  £d  on  the  node’s  actual 
dropout  rate.  In  other  words,  the  network  requires  that  the 
node  drops  packets  at  least  as  fast  as  £d-  The  overload  man¬ 
agement  policy  used  by  the  node  must  assure  that  £  >  £d 
and  that  the  dropped  packets  degrade  overall  control  sys¬ 
tem  performance  as  little  as  possible.  So  the  optimization 
problem  we  seek  to  solve  has  the  following  formal  statement, 

minimize:  Trace  (X^=1  CPq1Ct ) 

with  respect  to:  £1 ,  £2 ,  £3 ,  £4 

subject  to:  £  >  £d  (9) 

7T  =  7 rQ 

where  Pq1  is  computed  using  the  equations  in  theorem  4.2 
and  £d  is  a  specified  constant  (i.e.  a  measure  of  the  through¬ 
put  allocated  to  the  local  controller). 

We  considered  a  specific  system  to  verify  the  correctness 
of  our  theoretical  results.  This  example  is  a  distributed  sys¬ 
tem  in  which  local  controllers  switch  between  open-loop  and 
closed-loop  configurations.  A  closed-loop  configuration  oc¬ 
curs  when  dni,n2  =  0.  This  happens  when  the  Markov  chain 
state,  qni,n2 >  equals  qi  or  ^3.  An  open- loop  configuration  oc¬ 
curs  when  a  dropout  occurs  (i.e.  d[n]  =  1).  This  happens 
when  the  Markov  chain’s  state  is  either  #2  or  ^4.  The  system 
matrices  used  in  this  experiment  were, 


A  = 


0.9990  0.0100 

-0.1999  0.9990  ’ 


0.0005  0 
0.1000  0  ’ 


c  = 


1.0  0 
0  0.1 


B. 


When  there  is  no  dropout,  the  control  uni,n2  takes  the  form 

Uni,n2  —  K%ni  ,n2  %ni  ,n2  —  1  %ni  ,n2  +  1 


where  K  — 


-93.2580  -10.4700 

0  0 
closed-loop  system  therefore  become 


The  dynamics  of  the 
+l,n2  —  (A  +  BK)Xm,n2  T  BWni,n2i 


From  this  equation  we  see  that  the  closed-loop  distributed 
system  is  really  a  group  of  decoupled  subsystems.  When  a 
dropout  occurs  there  is  no  control  (i.e.  uni,n2  =  0)  and  the 
physical  coupling  between  subsystems  reasserts  itself.  So 
the  system  matrices  Ai  and  Bi  that  are  switched  to  when 
the  Markov  chain’s  state  is  qi  are  given  as  follows: 

Ax  =  A3  =A  +  BK 

B\  =  B3  =  0 

A2  =  A4  =  A 

B2  =  B4  =  B 

For  our  particular  problem,  Matlab’s  optimization  tool¬ 
box  was  used  to  numerically  solve  the  preceding  optimiza¬ 
tion  problem.  In  particular,  we  used  the  Matlab  function 
f  mincon  after  a  suitable  initial  condition  was  identified.  These 
optimizations  were  done  assuming  a  link  failure  rate,  /,  of 
0.3  for  £d  between  0.05  and  0.85.  The  solutions  are  the 
transition  probabilities  We  found  that  these  probabili¬ 
ties  took  the  following  form, 

'  xu  0,0,0  £d  <  0.2237 

r  1  _  1, 0,  x3, 0  0.2237  <  £d  <  0.4117 

[ei ,  £2,  £3,  S4j  -  <  lja.2)1;0  0.4117  <  £d  <  0.5833 

1, 1, 1,  X4  0.5833  <  £d 

where  x\,X2,xs,X4  are  determined  by  the  average  dropout 
rate  condition. 

The  plot  in  Figure  4  compares  the  performance  under  the 
’’optimal”  dropout  policy  (solid  line)  and  an  i.i.d.  (indepen¬ 
dently  identically  distributed)  dropout  policy  (dashed  line). 
This  is  the  performance  computed  using  our  theoretical  re¬ 
sults.  We  plot  the  log  power  level  (log(TraceE  [CPoC])  as 
a  function  of  the  transmission  rate  1  —  £d-  We  approximated 
the  infinite  distributed  system  in  Figure  1  using  a  Mat  Lab 
SimuLink  model  consisting  of  27  nodes.  All  of  the  simu¬ 
lations  assumed  free  boundary  conditions.  The  results  for 
these  simulations  are  plotted  as  *  and  o.  We  simulated  3 
different  runs  at  6  different  dropout  rates.  In  the  simula¬ 
tions,  we  estimated  the  power  by  time  average  taken  over 
100,000  iterations.  The  theoretical  predictions  show  that 
the  optimal  policy  is  indeed  better  than  the  i.i.d.  policy. 
In  reviewing  the  transition  probabilities  given  above,  it  is 
apparent  that  the  optimal  policy  is  a  soft  (2,  3)-policy  for 
£d  <  0.2237.  For  throughput  allocations  above  this  rate, 
however,  the  structure  of  the  optimal  policy  changes  to  al¬ 
low  higher  dropout  rates.  The  simulation  results  show  close 
agreement  with  the  theoretical  predictions  for  a  wide  range 
of  dropout  rates. 

The  Markov  chains  derived  in  this  section  form  the  ba¬ 
sis  for  an  overload  management  policy  that  is  easily  imple¬ 
mented  on  an  embedded  processor.  In  particular,  a  number 
of  these  ’’optimal”  Markov  chains  would  be  determined  for  a 
range  of  overload  conditions  (e^)  and  a  range  of  link  failure 
rates  (/).  We  store  these  transition  probabilities  in  a  table 
that  is  indexed  with  respect  to  £d  and  /.  The  overload  man¬ 
agement  policy  used  by  each  node  is  a  concrete  instantiation 
of  the  optimal  Markov  chain  whose  transition  probabilities 
are  loaded  from  this  table  based  on  1)  the  throughput  (e^) 
that  was  allocated  to  the  node  and  2)  based  on  the  link  fail¬ 
ure  rate  (/)  that  was  estimated  by  the  node.  What  should  be 
apparent  is  that  the  resulting  policy  is  adaptive  with  respect 
to  link  failure  rate  and  throughput  allocation.  Moreover, 
since  these  chains  are  solutions  to  the  optimization  problem 
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cost  vs  dropout  rate  for  different  dropout  policies 


Figure  4:  Simulation  versus  Theoretical  Results 


in  equation  9,  we  know  that  this  policy  degrades  application 
performance  as  little  as  possible.  In  other  words,  we  have 
provable  guarantees  that  this  approach  makes  optimum  use 
of  the  throughput  allocated  to  the  local  controller.  The 
simulation  and  theoretical  results  shown  in  Figure  4  suggest 
that  hardware  implementations  of  such  policies  should  also 
perform  well.  This  hardware  testing,  however,  remains  to 
be  done. 

6.  FINAL  REMARKS 

One  approach  to  sensor- actuator  network  middleware  ad¬ 
vocates  the  development  of  software  that  is  application  in¬ 
dependent  This  approach  is  meaningful  for  networks  that 
must  use  a  wide  variety  of  traffic  types  in  diverse  and  un¬ 
predictable  situations.  By  their  very  nature,  however,  em¬ 
bedded  sensor  actuator  networks  are  dedicated  to  a  specific 
type  of  application  whose  traffic  patterns  are  often  highly 
regular  and  predictable.  By  adopting  middleware  architec¬ 
tures  that  ignore  our  prior  knowledge  of  the  plant,  we  are 
doomed  to  under- utilize  network  resources  in  support  of  that 
application. 

This  paper  demonstrated  another  approach  to  network 
middleware  in  which  application  specific  knowledge  can  be 
used  to  optimize  middleware’s  support  for  the  application. 
In  particular  we  were  able  to  use  a  priori  knowledge  of  the 
application  (in  our  case  a  spatially  distributed  control  sys¬ 
tem)  to  develop  overload  management  policies  that  provide 
provable  levels  of  application  performance  subject  to  con¬ 
straints  on  the  network  QoS  (as  measured  by  packet  dropout 
rates).  The  policy  developed  here  is  based  on  a  Markov  de¬ 
cision  process  that  emerges  from  solving  an  optimization 
problem  in  which  a  priori  knowledge  of  the  application’s  dy¬ 
namics  was  incorporated.  The  resulting  policy,  we  feel,  can 
be  readily  realized  as  a  minor  variation  on  existing  overload 
management  schemes,  the  only  real  change  being  that  our 
decision  probabilities  are  based  on  formal  understanding  of 
system  dynamics,  rather  than  ad  hoc  reasoning  about  the 
plant. 

The  preceding  discussion  focused  on  sensor- actuator  net¬ 
works  in  the  control  of  spatially  distributed  systems.  This 
work,  however,  is  also  directly  relevant  to  more  traditional 
sensor  networks.  In  particular,  it  is  possible  to  apply  these 


results  to  investigate  the  impact  that  dropped  sensor  mea¬ 
surements  would  have  on  the  performance  of  a  smoother 
prediction  algorithm  used  in  tracking  a  vehicle’s  trajectory 
through  the  sensor  field.  In  particular,  if  we  were  to  use 
a  Kalman  filter  to  smooth  sensor  measurements  in  a  way 
that  estimates  the  vehicle’s  trajectory,  then  a  natural  ques¬ 
tion  concerns  the  impact  that  missing  or  intermittent  sen¬ 
sor  data  has  on  the  Kalman  filter’s  performance  [11].  In 
this  case  the  control  loop  shown  in  Figure  2  still  applies. 
However,  the  controller/plant  combination  is  now  just  the 
equivalent  error  dynamics  for  the  Kalman  filter,  the  input 
disturbance  is  either  sensor  or  process  noise,  and  the  output, 
y:  becomes  the  state  estimation  error.  This  paper’s  results, 
therefore,  can  also  be  used  to  characterize  the  increase  in 
mean  state-estimation  error  as  a  function  of  average  sensor 
measurement  dropout  rate.  So  while  the  paper’s  scope  origi¬ 
nally  rested  in  distributed  control,  the  results  are  relevant  to 
pure  sensor  networks  relying  on  optimal  mean  square  error 
predictors  to  track  changes  in  a  sensor  field. 

While  this  paper’s  computational  results  have  restricted 
their  attention  to  spatially  invariant  and  1-d  spatial  sys¬ 
tems,  the  analysis  could  also  be  extended  to  2-d  and  spatially 
varying  systems.  For  example,  the  orignal  system  equations 
describe  a  2-d  nearest  neighbor  system  if  we  introduce  an 
additional  spatial  variable  77,3  and  rewrite  equation  1  as 

Xn\  +I,n2  ,713  —  ,77,3 

3 n2  ~  ljn3  "F  ^711,712  +  1,^13) 

-\-B(xn  1  ,712,^3  —  1  +  Xn1,n2,n3  +  l) 

+  F(uni,n2,n3  T  I+77 1 , 77.2  )n3  ) 

The  analysis  inherent  in  theorem  4.2  assumed  spatial  invari¬ 
ance.  While  this  assumption  simplifies  the  theorem’s  proof, 
a  close  examination  of  the  proof  reveals  that  the  assumption 
is  not  necessary  for  characterizing  the  covariance  matrices. 
Extending  the  results  in  this  paper  to  a  spatially  varying 
system  is,  of  course,  computationally  intensive,  but  concep¬ 
tually  the  extension  seems  quite  straightforward. 

To  the  best  of  our  knowledge,  this  paper  provides  one  of 
the  few  positive  examples  in  which  network  QoS  was  related 
to  control  system  performance  in  a  way  that  directly  leads 
to  practical  network  protocols.  Future  work  will  continue 
in  this  direction,  by  attempting  to  use  formal  systems  sci¬ 
ence  for  the  systematic  development  of  embedded  network 
middleware.  Over  the  immediate  future  we  intend  to  relax 
some  of  the  assumptions  inherent  in  this  paper’s  develop¬ 
ment,  primarily  the  reliance  on  spatial  shift- invariance  and 
1-dimensional  structures.  Over  the  long  term,  we  intend  to 
validate  the  ideas  developed  in  this  paper  on  hardware-in- 
the-loop  experiments. 
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7.  PROOFS 

Proof  of  Theorem  4.1:  Let’s  define  the  matrix 


So  we  can  use  this  to  write  out  for  k  >  2, 


II 

•sT  ^ 

'Ki'KjFi{Xni+l,n2Xni  +  i,n2_k  \  qni,n2  q i 

qni  ,n2  —k  —  qj  } 

P k  Fi\Xm  ,n2  Xni  n2  —  k  } 

= 

TViTYj  Ai'E{xnijn2Xnijn2_k  |  <?ni,n2  —  qi i 

qni  ,n2—k  —  qj}Aj 

+ 

TViTYj  Ai(E{xnijn2Xni^n2_k-l  \  qni  ,n2  —  < 

Because  all  nodes  are  identical,  Pk  has  no  dependence  on 

qni,n2—k  —  qj} 

either  m  or  n2.  We  also  know  that 

-\-E{Xn1,n2Xriliri2-k  +  l  |  qni,n2  —  qi, 

Qni,n2-k  —  qj})Pj 

+ 

TYiTYjBi(TL{xni,n2-iXni,n2_k  |  ?ni,n2  =  1 

Xni~\-l,n2  —  AXni  )n2 

qni  ,n2—k  —  qj} 

B  (xni  jTi2  —  l  T  ^ni,ri2  +  l) 

+E{xni,n2  +  iXnijn2_/(.  |  qni,n2  —  qi , 

T  F  wni  jU2 

Qni,n2—k  — 

Xni  +  l,n2  —  k  —  Axni^n2—k 

+ 

7Ti7Tj  Hi  (E{xni,n2_iXni5n2  _/;._!  |  qni,n2  : 

T  B(xrtl  ^n2  —  k  —  l  T  Xnijn2—  fc  +  l) 

qni,n2—k  —  tb} 

T  F  wni  ^n2 —k 

TE{xni)n2  — lXni;n2_fc  +  1  |  qni,n2  —  qi, 
qni  ,n2  —k  —  qj} 

+  'E{Xni,n2  +  lXnijn2_k-l  |  qni,n2  —  qi , 

We  can  now  expand  out  Pk  as 

qni,n2  —  k  —  qj} 

+  E{xni,n2  +  iXnijn2_/c  +  1  |  qUi  ,n2  —  qii 

Qni,n2-k  — 

Pk  Fj\xUi  -\-l,n2  +  l,n.2  —  k  } 

—  y4(pfe+1  -f  Pfc_1)PT  +  P(Pfc+1  +  Pfc_i)y4T  There  are  nine  conditional  expectations  in  the  above  equa- 

—  T  —  —  —  T  tion.  The  first  expectation  can  be  simplified  as  follows, 

-\-APkA  +  B\Pk  +  Pk-2  +  Pk+2)B  +  Rk 


where  Rk  =  0  if  k  ^  0  and  Ro  =  R  (  R  is  the  covariance 
of  the  noise  process  w).  The  above  equation  is  the  third 
equation  in  the  theorem  (k  >  2).  We  use  a  similar  procedure 
to  get  the  other  two  equations  in  the  theorem.  • 

Proof  of  Theorem  4.2:  Note  that  if  qni,n2  —  Qi  and 
q<n\  ,n2  —k  —  qj,  then  the  states  at  time  m  +  1  can  be  written 
as 


%ri\  +1  ,n2 


%ni  -\-l,n2  —k 


AiXn i  ,n2 

+Bi(  ,n2  —  1  T  %n\  ,n2-\-l) 

T F UJni  ,n2 

AjXni  ^2 —k 

pPj(xni,n2-k-l  T  fc  +  l) 

T  F  wni  jU2_k 


(10) 


TViTYj  F‘{xni,n2Xnijri2_k\qn1,n2  —  q% , 

qni  ,n2  —  k  —  qj  } 

N 

—  'y  ^  iriirj'E{xni,n2xnijn2_k  \  qni ,n2  —  qi, 

l,m=l 

qn1,n2  —  k  —  qj  ,  qn1—l,n2  —  qi ,  <Pi -l,n2  —  k  —  <?m} 
P(Qni  —  l,n2  —  qi  •>  qn\  —  l,n2—k  —  qm  \ 

qn\,n2  —  qi-,  qni  ,n2  —  k  —  qj) 

N 

—  ^  ^  7T?;7Tj  E{x'ni  ;ri2  Xni  ;n2  —  fc  |  <pi  —  l,n2  —  qii 
l,m= 1 

qni~l,n2—k  —  qm\P  (qni  —  l,n2  —  qi  \  qni  ,n2  —  qi)  ' 

P (qni  —  l,n2—k  —  qm  \  qni  ,n2  —  k  qj) 

N 

5 l,m  qmj  qii 


EE** 


-  TTiTTj  7  7  rfe 

1=1  l,m 
N  N 


TViTVj 

=  EE  qii  qmj  P  l, m  (P) 


Here,  when  qUi,n2  —  qi,  we  denote  A(qnijTl2)  and  B(qnijn2)  The  second  expectation  can  be  simplified  as  shown  below, 

as  Ai  and  B{  respectively.  The  third,  fourth,  and  seventh  expectations  have  similar 
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derivations  and  aren’t  shown. 

TTiTTj  ^{Xnljn2Xn1,n2-k-l  \  Qni  ,n2  —  Qi-> 

Qni  ,n2  —k  —  Qj  } 

—  7T?;7TjE{a:ni  ;ri2  xni  ,n2  —  fc  — 1  \  Qni  ,n2  —  Q%  } 

N 

—  TTiTTj  ^  ^  E{xni  ;n2  ^ni  ,n2  —  fc  — 1  |  Qni,n2  ~  Qi  ? 
i,m=l 

Qni  —  1  ,n2  =  Ql  t  Qni  —  l,n2  —  k  —  l  — 

P(Qni  —  l,n2  =  Ql  i  Qni  —  l,n2  —  k  —  1  =  Qm  \  Qni  ,n2  — 

N 

—  TTiTTj  y  ^  E{xni  ;n2  ^ni  ,n2  —  k  —  l  \  Qni~l,n2  —  Ql , 
i,m=l 

Qni  —l,n2—k  —  l  — 

P^Qni  —l,n2  —  Ql  I  Qni,n2  — 

P(,Qni  —  l5ri2  —  A;  —  1  Qm  ) 

N 

E~j^l,m 

Qli^k+l 

l,m=  1 

The  fifth  expectation  can  be  simplified  as  shown  below: 
The  sixth,  eight,  and  ninth  expectations  have  similar  deriv¬ 
ations  and  aren’t  shown, 

'Ki'Kj  ^{Xni,n2  —  lXnijn2  —  k—l\Qni,n2  —  Qi-, 

Qni  ,n2  —  k  —  Qj  } 

7ri7rjE{xni5n2-ixnijn2-A;— l} 

N 

Pk 

l,m=  1 

With  these  simplifications  inserted  into  equation  10,  we 
obtain  the  third  equation  in  theorem  4.2.  A  similar  deriva¬ 
tion  can  be  used  to  obtain  the  first  two  equations.  • 
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Abstract 

This  article  addresses  the  routing  problem  for  large  wireless  networks  of  randomly 
distributed  nodes  with  Rayleigh  fading  channels.  First,  we  establish  that  the  distances 
between  neighboring  nodes  in  a  Poisson  point  process  follow  a  generalized  Rayleigh 
distribution.  Based  on  this  result  it  is  then  shown  that,  given  an  end-to-end  packet 
delivery  probability  (as  a  QoS  requirement),  the  energy  benefits  of  routing  over  many 
short  hops  are  significantly  smaller  than  for  deterministic  network  models  that  are 
based  on  the  geometric  disk  abstraction.  If  the  permissible  delay  for  short-hop  routing 
and  long-hop  routing  is  the  same,  it  turns  out  that  routing  over  fewer  but  longer 
hops  may  even  outperform  nearest-neighbor  routing,  in  particular  for  high  end-to-end 
delivery  probabilities. 

Index  terms  -  Communication  systems,  routing,  fading  channels,  ad  hoc  networks,  proba¬ 
bility,  Poisson  processes. 

1  Introduction 

Energy  consumption  in  multi-hop  wireless  networks  is  a  crucial  issue  that  needs  to  be  ad¬ 
dressed  at  all  the  layers  of  the  communication  system,  from  the  hardware  up  to  the  applica¬ 
tion.  In  this  article,  we  focus  on  routing  strategies  that  employ  hops  of  different  length  in  a 
network  whose  nodes  constitute  a  Poisson  point  process.  The  analysis  is  based  on  a  Rayleigh 
fading  channel  model,  and  the  results  demonstrate  that  the  properties  of  the  physical  channel 
have  a  substantial  impact  on  optimum  protocol  design  at  the  network  layer. 

Often,  a  deterministic  “disk  model”  is  used  for  the  analysis  of  multi-hop  packet  networks 
[1—9] ,  where  the  radius  for  a  successful  transmission  has  a  deterministic  value,  irrespective  of 
the  condition  of  the  wireless  channel.  Interference  is  taken  account  using  the  same  geometric 
disk  abstraction.  The  stochastic  nature  of  the  fading  channel  and  thus  the  fact  that  the 
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signal-to-noise-and-interference  ratio  (SINR)  is  a  random  variable  are  neglected.  Using  such 
models,  it  is  easy  to  show  that,  for  a  path  loss  exponent  of  a,  there  is  an  energy  gain  of  na~l 
if  a  hop  over  a  distance  d  is  split  into  n  hops  of  distance  d/n.  However,  the  volatility  of  the 
channel  cannot  be  ignored  in  wireless  networks  [10,11];  the  inaccuracy  of  “disk  models”  has 
also  been  pointed  out  in  [12]  and  is  easily  demonstrated  experimentally  [13,14],  In  addition 
the  “prevalent  all-or-nothing  model”  [15]  leads  to  the  assumption  that  a  transmission  over 
a  multi-hop  path  either  fails  completely  or  is  100%  successful,  ignoring  the  fact  that  end-to- 
end  packet  loss  probabilities  increase  with  the  number  of  hops  (unless  the  transmit  power  is 
adapted) . 

To  overcome  some  of  these  limitations  of  the  “disk  model” ,  we  employ  a  simple  Rayleigh 
fading  link  model  that  relates  transmit  power,  large-scale  path  loss,  and  the  success  of 
a  transmission.  The  end-to-end  packet  delivery  probability  over  a  multi-hop  route  is  the 
product  of  the  link- level  reception  probabilities. 

While  fading  has  been  considered  in  the  context  of  packet  networks  [16, 17],  its  impact 
on  the  network  (and  higher)  layers  is  largely  an  open  problem.  This  paper  attempts  to 
shed  some  light  on  this  cross-layer  issue  by  analyzing  the  performance  of  different  routing 
schemes  under  Rayleigh  fading.  In  Section  2,  we  introduce  the  link  model  and  show  that 
noise  and  interference  can  be  treated  separately.  We  also  prove  that  the  internode  distances 
in  a  Poisson  point  process  follow  a  generalized  Rayleigh  distribution;  we  define  and  determine 
the  path  efficiency;  and  we  introduce  the  five  routing  strategies  we  consider.  In  Section  3, 
the  energy  consumption  of  those  five  routing  strategies  is  analyzed;  Section  4  discusses  the 
performance  of  those  routing  schemes  under  equal  delay  constraints,  and  Section  5  concludes 
the  paper. 

2  The  Rayleigh  Network  Model 

2.1  The  Rayleigh  fading  link  model 

We  assume  a  narrowband  Rayleigh  block  fading  channel.  A  transmission  from  node  i  to 
node  j  is  successful  if  the  SINR  7^  is  above  a  certain  threshold  ©  that  is  determined  by 
the  communication  hardware,  and  the  modulation  and  coding  scheme  [10].  The  SINR  7  is 
a  discrete  random  process  given  by 

R 

7  ~~  N  +  r 

R  is  the  received  power,  which  is  exponentially  distributed  with  mean  R.  Over  a  transmis- 
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sion  of  distance  d  =  || ay  —  x3 1|2  with  an  attenuation  da,  we  have  R  =  Pod~a,  where  P0  is 
proportional  to  the  transmit  power1.  N  denotes  the  noise  power,  and  I  is  the  interference 
power  affecting  the  transmission,  he.,  the  sum  of  the  received  power  from  all  the  undesired 
transmitters. 


Theorem  1  In  a  Rayleigh  fading  network,  the  reception  probability  pr  :=  P[y  ^  0]  can  be 
factorized  into  the  reception  probability  of  a  zero-noise  network  and  the  reception  probability 
of  a  zero-interference  network. 

Proof:  Let  Bo  denote  the  received  power  from  the  desired  source  and  B, ,  i  =  1 , ,k, 

the  received  power  from  k  interferers.  All  the  received  powers  are  exponentially  distributed, 
i.e.,  PR^rf)  =  1  / R,  crri/Ri- ,  where  R,  denotes  the  average  received  power  R,  =  Pid~a.  The 
probability  of  correct  reception  is2 


Pr  =P[i?0  >0(1  +  N)] 

=E,  [  exp  (  - 

V  Kq 


e(Eti  n  +  aQ\ 
R0  > 


k 

it*  ^rf)  dri  •  •  •  drfc 

2=1 


(2) 

(3) 

(4) 

(5) 


Pr  is  the  probability  that  the  SNR  y/y  :=  Rq/N  is  above  the  threshold  0,  i.e.,  the  reception 
probability  in  a  zero-interference  network  as  it  depends  only  on  the  noise.  The  second  factor 
Pr  is  the  reception  probability  in  a  zero-noise  network.  □ 

This  allows  an  independent  analysis  of  noise  and  interference.  If  the  load  is  light,  then  the 
number  of  interferers  k  is  relatively  small  and/or  their  distances  are  relatively  big,  which 
implies  SHTtSNR,  thus  the  noise  analysis  alone  provides  accurate  results.  This  has  been 
demonstrated  in  [18]  in  the  case  of  slotted  ALOHA  with  transmit  probability  p.  Indeed, 
for  small  p,  the  throughput  is  proportional  to  p,  indicating  that  there  are  no  losses  due 
to  collisions,  i.e.,  pi  ~  1  for  all  timeslots  and  all  transmissions.  For  high  load,  a  separate 

1This  equation  does  not  hold  for  very  small  distances.  So,  a  more  accurate  model  would  be  R  =  Pq  ■ 
(d/do)~a,  valid  for  d  ^  do,  with  P'{)  as  the  average  value  at  the  reference  point  do,  which  should  be  in  the 
far  field  of  the  transmit  antenna.  At  916MHz,  for  example,  the  near  field  may  extend  up  to  lm  (several 
wavelengths) . 

2  A  similar  calculation  has  been  carried  out  in  the  Appendix  of  [16]  for  a  network  with  spreading  gain  and 
equal  transmit  powers  for  all  nodes. 
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interference  analysis  has  to  be  carried  out  to  determine  .  Note  that  power  scaling,  i.e., 
scaling  the  transmit  powers  of  all  the  nodes  by  the  same  factor,  does  not  change  the  SIR  (pi 
only  depends  on  power  ratios),  but  (slightly)  increases  the  SINR.  Since  only  p ^  is  affected  by 
absolute  energy  levels,  we  will  focus  on  zero-interference  networks  to  identify  energy-efficient 
routing  strategies. 

In  a  zero-interference  network,  the  reception  probability  over  a  link  of  distance  d  at  a 

QN 

transmit  power  P0,  is  given  by  pr  :=  P[7./v  ^  0]  =  e  pod~a  ,  therefore 


Po  = 


da@N 
—  In  pr 


(6) 


Note  that  for  high  probabilities,  the  packet  loss  probability  1  —  pr  is  tightly  upperbounded 
by  the  normalized  mean  NSR  QN/R0=Q/ yjv  [19].  Since  —  In pr  ~  1  —  pr,  we  can  also  say 
that  the  packet  loss  probability  is  inversely  proportional  to  the  transmit  power  for  high  pr. 


2.2  Poisson  random  networks 

We  consider  a  Poisson  point  process  of  intensity  A  in  the  plane,  where  the  probability  of 
finding  k  nodes  in  an  area  A  is  given  by  the  Poisson  distribution3 

P[fc  nodes  in  A]  =  e~XA  — .  (7) 

rC ! 

Note  that  for  infinite  networks,  the  Poisson  point  process  corresponds  to  a  uniform  distri¬ 
bution  [12,20],  and  for  large  networks,  the  two  distributions  are  equivalent  for  all  practical 
purposes.  Henceforth  we  denote  a  network  whose  nodes  constitute  a  two-dimensional  Pois¬ 
son  point  process  as  a  Poisson  random  network.  Without  loss  of  generality,  we  will  restrict 
ourselves  to  the  case  A  =  1  (unit  density),  since  the  product  A  A  can  always  be  scaled  such 
that  A  =  1. 

For  the  routing  schemes  we  consider,  we  need  to  determine  the  distance  from  one  node 
to  its  neighboring  nodes  that  fie  within  a  sector  0,  i.e.,  within  ±0/2  of  the  source-destination 
axis  (Fig.  1). 

Proposition  1  (Distance  to  nearest  neighbor.)  In  a  Poisson  random  network  with  unit 
density,  the  distance  R  between  a  node  and  its  nearest  neighbor  in  a  sector  0  is  Rayleigh  dis¬ 
tributed  with  mean  y/7r/(20). 

3This  can  be  generalized  to  higher  dimensions  if  A  is  the  Lebesgue  measure  of  the  subset  considered. 
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Figure  1:  Part  of  a  Rayleigh  network  with  the  source  at  the  origin  and  the  a:- axis  pointing 
towards  the  destination  node.  R  denotes  the  distance  to  the  nearest  neighbor  within  a  sector 
0  around  x.  and  'ip  is  its  argument.  Hence  (R,ip)  are  the  polar  coordinates  of  the  nearest 
neighbor  within  a  sector  0,  and  ( X ,  Y)  are  its  Cartesian  coordinates. 

Proof:  Let  R  be  the  distance  to  the  nearest  neighbor  in  a  sector  0.  The  probability 

that  there  is  no  neighbor  in  a  sector  0  up  to  a  distance  r  is  the  complementary  cumulative 
distribution  P [R  >  r]  =  e-7,2^2,  thus  the  probability  density  is  Pr(t)  =  r0e-r 2<^/2,  which  is 
a  Rayleigh  distribution  with  mean  \f^f\ 20)  and  variance  2/0  —  7t/(20)  =  (4  — 7r)/(20).  The 
distribution  of  the  argument  xp  is  uniform  between  —0/2  and  0/2.  □ 

Definition  1  (Rayleigh  network.)  A  Rayleigh  network  is  a  Poisson  random  network 
where  the  physical  channel  is  subject  to  Rayleigh  fading. 

To  compare  different  routing  schemes,  we  need  to  define  the  path  efficiency. 


Definition  2  (Path  efficiency.)  The  path  efficiency  is  the  ratio  of  the  Euclidean  distance 
between  the  end  nodes  and  the  actually  travelled  distance. 

As  an  example,  the  path  efficiency  (for  long  connections4)  in  a  square  lattice  network 
with  nearest-neighbor  routing  is 

«(0)  =  7 -  .  1  7  ■ - -7 

|  cos  0|  +  |  sin  0| 

4  This  is  based  on  the  assumption  that  the  angle  towards  the  destination  is  uniformly  distributed  over 
[0,2tt). 
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where  0  denotes  the  direction  from  transmitter  to  receiver  relative  to  the  orientation  of  the 
grid.  The  maximum  is  1,  the  minimum  is  l/\/2  (at  0  =  x  /  I  +  kx /2).  and  the  expected  value 
is 

2\/2  \[2 

r]  :=  E[k]  = - arctanh  —  fa  0.79  .  (9) 

7T  2 

We  will  use  y  to  denote  the  expected  path  efficiency. 

Definition  3  (Link  efficiency.)  The  link  efficiency  is  X/R  =  cos(\I/),  where  X  is  the  x- 
coordinate  of  the  position  of  the  nearest  neighbor  (see  Fig.  1). 

In  Rayleigh  networks  with  long  connections,  the  distances  and  angles  to  nearest  neighbors 
are  iid,  so  the  expected  path  efficiency  equals  the  expected  link  efficiency: 


Proposition  2  (Path  efficiency  in  Rayleigh  networks.)  In  a  Rayleigh  network  with 
nearest-neighbor  routing  in  a  sector  0,  the  expected  path  efficiency  for  a  long  connection  is 


v(4>)  = 


sin 


1  - 


24  ’ 


(10) 


where  the  approximation  is  the  second-order  Taylor  expansion. 

Proof:  The  expected  link  efficiency  is  E[cos\k],  and  since  the  hop  distances  are  iid,  this  is 
also  the  expected  path  efficiency.  Thus  we  have 


7?(0)  =  E[cos  T] 


cos  ip  dip  =  —  sin 
<P 


(11) 


□ 

To  compare  the  transport  capacity  of  different  routing  schemes,  we  have  to  determine  the 
progress  X.  Changing  to  Cartesian  coordinates,  the  joint  probability  pxy(x,y)  for  the 
nearest  neighbor’s  position  (in  a  sector  0  ^  7r)  is 

x2+  2 

Pxy(x,y)  —  e  2^  ,  x  (0  0,  —  xtan(0/2)  ^  y  ^  a;tan(^>/2)  (12) 


Integration  with  respect  to  y  yields  the  marginal  density 

Px(x)  =  erf  tan  \f^fe~x2<l)/2  ’  (13) 

where  erf(-)  denotes  the  error  function,  i.e.,  erf(x)  :=  2/y/7r  ff  e~f2dt.  It  is  easily  verified 
that  for  small  0,  as  X  ~  R,  this  tends  to  a  Rayleigh  distribution.  On  the  other  hand,  for 
0  — *  7r,  tan(0/2)  — ►  oo,  and  the  error  function  tends  to  1,  and  the  distribution  approximates 
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the  one-sided  Gaussian  \f2e~x2'K^  ^  x  Y  0,  with  mean  \/2/tt.  Y  has  zero  mean  and  becomes 
Gaussian  for  f  — >  it  with  variance  l/'K.  Note  that  X  and  Y  are  not  independent  unless 
4>  =  7T  or  cj)  =  2n. 

The  expected  progress  E[X]  is 


E[X]  = 


(14) 


which  corresponds,  as  expected,  to  the  product  of  the  mean  path  efficiency  r)(<f>)  (10)  and 
the  mean  of  the  Rayleigh  distribution  \Jtx  f  {2(f)). 

Next,  we  generalize  Proposition  1. 


Proposition  3  (Distance  to  n-th  neighbor.)  The  probability  density  of  the  distance  to 
the  n-th  neighbor  in  a  sector  <f>  is 

2 


PRn(r)  =  r 


2n—l 


D-r2cf)/ 2 


(15) 


2)  (n  — 1)! 

Proof:  Let  Sk  be  the  k- th  coefficient  in  the  Poisson  distribution:  Sk  ■—  [r2<f)/2)k /k\.  The 
probability  that  there  are  less  than  n  nodes  closer  than  r  in  the  sector  <p  is 

71—1 

Pn  :=  P[0  . . .  n  —  1  nodes  within  r]  =  Sk  e_r2^2  .  (16) 


k= 0 


Hence, 


77—1 


77—1 


PH.  =  i  d  -  Pn)  =  ~  E  YPtlftl  r<p  )  e~rX,l 

^  k= 0  k= 1  ' 


(17) 


The  only  term  that  is  not  cancelled  in  the  two  sums  is  the  one  at  n  —  1,  leading  to 


PRn  =  rf  ■ 


Erlang  distribution 


(18) 


which  is  identical  to  (15).  □ 

Since  pnn  is  a  Rayleigh  distribution  for  n  =  1,  it  can  be  considered  a  generalized  Rayleigh 
distribution.  Similarly,  in  one  dimension,  the  Erlang  distribution  is  a  generalized  exponential 
distribution.  So,  the  transition  from  one  dimension  to  two  dimensions  simply  entails  a 
multiplication  by  r<f>  (that  comes  from  the  inner  derivative  of  the  exponential  part)  in  the 
distributions  of  the  node  distances. 

The  mean  of  Rn  is  given  by 


E[Rn] 


V2TJn+m~V2r — -y 

r(n)  ~^v  1+4' 


(19) 
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To  get  this  approximation,  we  take  the  first  term  from  the  series  expansion  r(-p|J-j/2-)  = 
y/n(l  —  +  0(^2))  [21]5  and,  noting  that  this  is  not  accurate  for  small  n,  adjust  it  (by 

adding  7r/4  —  1  to  n)  such  that  the  approximation  is  precise  for  n  =  1.  For  n  >  1,  this  yields 
a  very  tight  upper  bound. 

The  second  moment  is  2 n/0,  hence  the  variance  is 

0 79  A  —  7 t 

Var [Rn\  =  —  -  E[Rnf  «  —  ,  (20) 

using  the  same  approximation.  From  the  series  expansion  above,  we  know  that  Var [/?,,]  = 
^  +  O(P).  So  we  have  found  sharp  lower  and  upper  bounds  on  the  variance: 

<  VarK]  <4  VnG  N  (21) 

20  20 

Since  4  —  7r  ~  1,  we  can  conclude  that,  interestingly,  the  variance  is  almost  independent  of 


2.3  Energy  consumption  of  a  route 

Assume  a  connection  from  node  0  to  node  n  in  a  multi-hop  network.  The  desired  end-to-end 
reliability  is  Pee-  The  reception  probability  of  a  chain  of  n  nodes  is 


pn  =  e  0,/7<  =  e  e^'-1  7 i 
1=1 


(22) 


where  %  denotes  the  mean  SNR  at  link  i. 

Assume  that  the  maximum  transmit  power  Prnax  is  sufficient  to  reach  node  n  in  a  single 
hop  with  probability  of  Pee-  The  question  is  how  many  hops  are  optimum  in  terms  of 
energy  consumption.  First,  we  consider  the  case  of  a  one-dimensional  chain  of  equidistant 
nodes  with  distance  d.  Let  E0  be  the  energy  required  for  a  transmission  over  distance  d  with 
probability  Pee,  be.,  E0  :=  —daQN/  In  Pee-  Covering  the  total  distance  in  one  single  hop 
requires  an  energy  of  Ei  =  naE0.  In  the  multi- hop  case  with  n  hops,  a  reception  probability 
Pr  =  \J Pee  is  required  at  each  hop.  Since  In  Pee  =  nhipr,  the  total  energy  in  this  case  is 
En  =  n  ■  nE0.  So,  for  a  =  2,  there  is  no  benefit  in  multi-hop  routing. 

5Note  that  this  series  can  also  be  derived  by  using  the  identity  =  P-J  ail(;l  applying 

Stirling’s  approximation. 

6This  is  not  the  case  for  one-dimensional  networks,  where  the  variance  increases  linearly  in  n  (variance 
of  the  Erlang  distribution  with  parameter  n). 
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For  a  regular  rectangular  lattice  network  where  the  maximum-hop  case  corresponds  to 
routing  over  nearest  neighbors,  we  still  have  En  =  n2 E(} ,  whereas  for  the  single-hop  case,  we 
get  Ei  =  ( nn)aE0 ,  where  k  is  the  path  efficiency.  The  energy  consumption  is  identical  for 


2  logn 
log  (nn) 


(23) 


For  a  =  4  and  k  =  1  / \/2,  we  get  E\  =  E2,  so  even  for  large  path  loss  exponents,  there  is  no 
benefit  in  dividing  a  one-hop  diagonal  connection  into  two  nearest-neighbor  hops. 


2.4  Routing  strategies 

Here,  we  introduce  the  five  routing  strategies  that  will  be  analyzed  in  the  next  Section. 
In  order  to  implement  those  routing  algorithms,  it  is  assumed  that  all  nodes  know  their 
own  location  and  that  the  source  node  knows  the  direction  towards  the  destination.  The 
strategies  are  the  following  (see  Fig.  2): 

A.  Multi- hop:  Transmit  hop  by  hop  over  n  nearest- neighbor  hops  in  a  sector  cf>,  i.e., 
the  next-hop  node  is  the  nearest  neighbor  that  lies  within  ±</>/2  of  the  axis  to  the 
destination. 

B.  Single-hop:  Transmit  directly  to  the  n-th  node  in  the  route  found  in  A. 

C.  Single-hop:  Transmit  directly  to  the  n'-th  nearest  neighbor  in  the  sector  0. 

D.  Single-hop:  Transmit  directly  to  the  n-th  nearest  neighbor  in  a  sector  q !>'  <  0. 

E.  Single-hop:  Transmit  directly  to  the  nearest  neighbor  in  a  sector  <fi"  <  ([)' . 

Note  that  “single-hop”  here  does  not  mean  that  the  (final)  destination  is  reached  in  one 
hop,  but  it  means  that  a  certain  progress  X0  is  made  in  one  hop  rather  than  in  n  hops.  So, 
relative  to  the  end-to-end  connection,  all  the  schemes  are  multi-hop,  but  A  uses  short  hops, 
whereas  B  through  E  use  long(er)  hops.  For  a  fair  comparison,  we  shall  choose  n',  0',  and 
<fi"  for  strategies  C,  D  and  E,  respectively,  such  that  the  expected  progress  X  is  the  same  as 
for  A  and  B.  As  long  as  routing  decisions  are  taken  locally,  virtually  all  routing  strategies 
fall  into  those  categories;  whatever  the  specific  properties  of  an  algorithm  are,  the  goal  will 
always  be  to  make  some  progress  in  the  right  direction. 
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Figure  2:  Illustration  of  the  different  routing  strategies.  Strategic  A  follows  the  solid  line 
(multiple  hops),  B  and  D  follow  the  dashed  line,  C  follows  the  dotted  line,  and  E  follows  the 
dash-dotted  line.  The  sectors  0,  0',  and  0"  are  represented  by  different  grades  of  gray. 

3  Energy  Consumption 

In  this  Section,  the  energies  required  to  deliver  a  packet  over  a  progress  X  :=  nr;(0)E[i?(0)]  = 
nrj(<f)  \Jix /(2(p)  with  probability  Pee  are  determined  for  those  five  strategies,  normalized  by 

GN 

—  In  Pee  * 


3.1  Analysis  of  the  five  routing  strategies 

Strategy  A.  With  pr  =  Pee,  the  total  expected  energy  consumption  is  rr £[/?."] : 

EA=ni{lTr(i*i)  m 

" - V - ' 

Strategy  B.  First,  we  need  to  establish  the  path  efficiency.  In  strategy  A,  the  angle  to  the 

next  node  is  uniformly  distributed  in  ±0/2.  With  increasing  n,  the  argument  T  from  the 

origin  to  the  n-th  node  tends  to  be  Gaussian  distributed  with  variance  V(d,  n).  Since  the 

support  of  the  pdf  of  T  is  always  [—0/2,  0/2]7,  the  variance  decreases  inversely  proportional 

7Hence,  after  every  convolution,  the  support  needs  to  be  scaled,  which  results  in  a  reduction  of  the 
variance. 
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(25) 

(26) 


to  n,  i.e.,  V (<p,  n) 


Ah 

12  n' 


So,  for  large  n,  we  get8 


n)  =  E[cos  T] 
VQn  f° 

c 


cos  xp  e  6n(^)  d ip 


41 

=  g  24  n 


£_ 

24 n  ’ 


(27) 

(28) 


from  which  we  find  1  /t]b(ti,  (p)a  ~  1  +  Clearly,  rjsin,  <p)  ^  vifi)  with  equality  only  for 
n  —  1.  Since  we  are  transmitting  over  a  distance  riR-^,  we  get 

=  »■*[*■] (£)*  »  «■  (|)”/2  r  (l  +  f )  (l  -  (29) 

So,  the  increase  of  na~2  compared  with  strategy  A  is  partially  compensated  for  by  the  factor 
( v/Vb)™ ,  he.,  the  increase  in  path  efficiency. 


Strategy  C.  A  direct  transmission  to  the  n'-th  neighbor  in  the  sector  (p  shall  result  in  the 
same  expected  progress  as  in  strategy  A,  i.e.,  E[i?n/]  =  nE[i?].  From  (19),  we  have 


V2 


(30) 


which  yields 

n'  =  \(n2-  1)  +  1,  (31) 

independent  of  <p .  The  path  efficiency  for  this  single- hop  transmission  is  still  rj((P),  since  the 
argument  of  the  destination  node  is  uniformly  distributed  in  the  sector  cp.  We  have9 


ER'l 


r  (n'  +  f ) 
r(n') 


Plugging  in  (31)  gives 


(32) 


(33) 


8 The  Gaussian  approximation  is  very  accurate  even  for  small  n.  For  (/)  =  tt/ 2  and  n  =  1,  e.g .,  the  precise 
value  is  2\f2/Tt  «  0.9003,  whereas  this  approximation  yields  e_7r  Z96  «  0.9023,  so  the  error  is  only  0.2%.  The 
second  order  Taylor  expansions  are  identical,  even  for  n  =  1. 

9 Again,  we  use  the  series  expansion  ^  =  nr^ (\  —  O(-T))  and  adjust  the  constant  such  that  the 

approximation  is  precise  for  a  =  n'  =  1.  Note  that  for  a  =  1,  this  is  identical  to  (19).  For  a  =  2,  E[i2“,]  oc  n', 
and  for  a  =  4,  E [R%,]  oc  n'2 .  The  absolute  error  in  the  approximation  for  a  =  2  is  ttv^/4  —  1  ~  0.11, 
independent  of  nf . 
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Strategy  D.  The  reduction  of  the  sector  angle  from  0  to  0'  shall  ensure  that  the  transmis¬ 
sion  to  the  77-th  nearest  neighbor  within  0'  results  in  the  same  progress  as  n  nearest-neighbor 
hops  in  0,  i.e., 

nE[i?(0)]  •  77(0)  =  E K(0')]  •  77(0') .  (34) 


Since  R  and  Rn  depend  on  0  and  0',  respectively,  0'  cannot  be  expressed  in  a  closed  form. 
However,  we  can  find  a  tight  upper  bound  on  the  energy  consumption  (lower  bound  on  0') 
by  assuming  that  the  path  efficiency  does  not  increase  with  decreasing  angle: 


4(77  —  1)  +  7T 
n2ir 


(35) 


0'  ~  0/n  with  high  accuracy.  With  (32),  we  get  the  upper  bound  BD 

E  ■  .  (  2 ™2  y/2r("+!> 

ED  E[iSJ  «  Bo  ^(4(n  _  1}  +  ■  (36) 

To  get  a  lower  bound,  we  assume  that  the  path  efficiency  for  0'  is  1.  Solving  nE[i2(0)]7y(0)  = 
E[i?n(0')]  using  (19)  yields 

^<^4(n_1)+7r  1  ( Or~7\ 

0  ~  0  77%  77(0)2  ’  (3?) 

and  we  find 

Bd'ZEdZ  BDr/((f))a .  (38) 


For  large  77,  Ed  tends  to  the  lower  bound.  Since  Bd  has  the  same  asymptotic  behavior  for 
large  n  as  strategy  C,  strategy  D  outperforms  C. 


Strategy  E.  Here  we  determine  an  even  smaller  angle  0"  such  that  the  first  neighbor 
within  that  sector  has  the  same  expected  distance  as  n  hops  in  the  original  sector  0.  As  in 
D,  the  equation  nE[i?(0)]7y(0)  =  E[i?(0")]7](0")  does  not  have  a  closed-form  solution,  and 
we  have  to  bound  Ee ■  From  7 ](0")  >  77 (0)  we  have  0"  ^  0/r72  and  thus 

Ee  <  Be  :=  77“  7  T  (l  +  |)  .  (39) 

On  the  other  hand,  assuming  77 (0")  =  1,  we  find  a  tight  (for  n  >  2)  lower  bound  BEr]((f))a , 
hence 

Be  >  Ee  ~  5e77(0)“  .  (40) 
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Table  1:  Normalized  energy  consumption  of  the  five  routing  strategies.  The  approximations 
are  second-order  Taylor  expansions. 

3.2  Comparison 

In  the  previous  Subsection,  the  energy  was  calculated  for  a  transmission  over  a  distance  of 
X  =  riE[R]r).  To  determine  the  energy  required  per  unit  distance ,  the  energies  are  divided 
by  Xa.  Table  1  compares  the  five  proposed  routing  strategies. 

Remarks: 

•  For  strategy  A,  the  energy  consumption  decreases  with  increasing  n  for  a  >  2.  If  we 
take  end-to-end  probabilities  and  path  efficiencies  into  account,  the  benefit  of  multi¬ 
hop  routing  under  Rayleigh  fading  is  substantially  smaller  than  under  the  deterministic 
“disk  model”:  for  a  =  2,  it  is  always  detrimental  to  use  short  hops,  and  for  a  =  3, 
0  =  7t/2  and  n  =  2,  we  have  Eb/Ea  ~  1.76,  so  skipping  one  hop  is  only  76%  more 
expensive,  rather  than  2Q_1  =  400%,  as  an  overly  simplistic  analysis  would  yield  (see 
the  remarks  in  Section  1). 

•  The  single-hop  strategies  B-E  all  have  comparable  performance.  With  increasing  n, 
their  energy  consumption  decreases  since  the  path  efficiency  increases. 

•  For  strategy  C,  and  a  =  3,  the  approximation  does  not  precisely  agree  with  A  for 
n  =  1  due  to  the  approximation  of  the  r(-)  function  for  half- integers. 

•  For  strategies  B,D,  and  E,  the  path  efficiencies  tend  to  1  with  increasing  n  at  different 
speeds. 

•  Strategy  D  is  the  only  one  where  the  energy  consumption  goes  to  1  with  increasing  n. 
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In  terms  of  the  complexity  of  the  implementation  of  those  schemes,  B  and  D  are  similar  to 
A,  while  E  is  simpler,  since  only  one  nearest  neighbor  has  to  be  identified.  C,  on  the  other 
hand,  is  impractical  for  large  n  due  to  the  increase  in  n 2  in  (31).  For  n  —  5,  for  example, 
the  20th  neighbor  would  need  to  be  identified,  which  may  cause  too  much  overhead. 


3.3  Determination  of  the  sector  angle  (f) 


From  the  energy  analysis,  it  can  be  seen  that  it  is  desirable  to  choose  ([>  as  small  as  possible 
to  achieve  a  high  path  efficiency.  On  the  other  hand,  we  have  to  make  sure  that  a  sufficient 
number  of  neighbors  can  be  reached  with  high  probability  within  the  sector.  The  maximum 
distance  R  that  can  be  covered  at  the  peak  transmit  power  Pmax  to  guarantee  a  reception 
probability  pr  is 

1  .1 


R  = 


InPr) 


7n 


-(— lnpr) 


ON  J  \  0 

If  pc  is  the  desired  probability  of  finding  a  neighbor  within  0,  we  get  from  Proposition  1 


(41) 


0  > 


—2  ln(l  -pc) 
R2 


—2  ln(l 


/  ON  \ « 

\  Pmax  In  Pr  / 


(42) 


For  0  =  7t/2,  for  example,  the  expected  distance  to  the  nearest  neighbor  is  1,  but  for 
pc  =  99.9%  (high  probability  of  connectivity),  a  range  R  =  -y/12  In  10/7T  ~  3  is  needed. 


4  Delay  Considerations 

Clearly,  if  the  delay  constraints  are  tight,  routing  over  many  short  hops  may  be  prohibitively 
slow.  If,  on  the  other  hand,  the  delay  induced  by  an  n-liop  connection  is  tolerable,  the  energy 
consumption  in  the  single-hop  case  can  be  further  reduced  by  exploiting  time  diversity  in 
the  form  of  retransmissions. 

4.1  Multiple  long- hop  transmissions 

For  a  fair  comparison,  we  assume  a  delay  constraint  of  n  timeslots  for  both  the  multi-hop 
(over  n  hops,  he.,  one  transmission  in  every  slot)  and  the  single- hop  schemes. 

We  consider  two  strategies  to  exploit  the  fact  that  n  timeslots  are  available  in  the  case 
of  a  single- hop  transmission  (over  an  n  times  larger  distance) ,  namely  the  schemes  with  and 
without  availability  of  channel  state  information  (CSI)  at  the  transmitter. 
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Without  CSI  With  CSI 


0.8  p  n  0.8  p 
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Figure  3:  Gain  of  the  n-transmission  schemes  without  (left)  and  with  (right)  channel  state 
information  over  a  single  transmission  in  the  first  timeslot  as  a  function  of  the  number  of 
available  time  slots  n  and  the  desired  reliability  Pee-  Points  with  gain  smaller  than  1  are 
not  shown. 


No  CSI:  Transmit  n  times.  For  a  block  fading  channel,  we  can  assume  independent 
fading  levels  at  each  time  slot.  Since  the  delay  constraint  gives  time  for  (at  most)  n  transmis¬ 
sions,  there  is  a  benefit  from  time  diversity  (selection  combining).  The  required  single- use 
reception  probability  Pee, i  is  then  given  by 

Pee,  i  =  1  —  (1  —  Pee)71  ■  (43) 


Compared  with  the  single-transmission  case,  this  leads  to  an  energy  gain  of 

log  Pee,  i 


G  = 


n  log  Pee  ’ 


(44) 


GNda 


since  from  (6),  the  necessary  transmit  power  in  the  single-transmission  case  is  iog  pEEi 
whereas  for  the  multi-transmission  case,  it  is  —  . 

If  we  assume  a  simple  ARQ  scheme,  where  a  short  ACK  or  NACK  packet  is  sent  im¬ 
mediately  after  the  receipt  of  the  data  packet,  this  gain  is  almost  doubled  (and  the  average 
delay  cut  in  half),  since  after  a  successful  transmission,  the  packet  does  not  have  to  be  re¬ 
transmitted  any  longer. 


With  CSI:  A  single  opportunistic  transmission  when  the  channel  is  best. 

With  CSI  at  the  transmitter,  the  packet  can  be  scheduled  to  be  transmitted  when  the  fading 
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Without  CSI 


With  CSI 


Figure  4:  Gain  of  the  schemes  without  (left)  and  with  (right)  channel  state  information  over 
a  multi-hop  scheme  as  a  function  of  the  number  of  available  time  slots  n  and  the  desired 
reliability  Pee ■  The  path  loss  exponent  ct  is  3.  Points  with  gain  smaller  than  1  are  not 
shown. 


level  is  best.  In  other  words,  the  best  time  slot  out  of  n  is  picked  for  a  single  transmission  of 
the  packet.  Note  that  if  the  delay  constraint  n  is  not  hard  but  an  average  constraint,  then 
this  scheme  may  choose  from  2 n  slots.  The  cumulative  distribution  of  the  maximum  of  n 
fading  random  variables  is  Fn(x)  =  (1  —  e~x!R)n  with  R  =  Pod~a.  From 


Pee  —  1  —  (1  —  e 


9JV 

R 


R  = 


GN 


ln(l  —  (1  —  Pee)u ) 


(45) 


we  see  that  the  reception  probability  increases  in  the  same  way  as  in  the  multiple  transmission 
case  but  with  only  one  transmission.  So,  this  strategy  has  an  n  times  higher  gain  than  the 
one  without  CSI. 

Figure  3  shows  the  gains  for  both  strategies  (as  a  function  of  the  number  of  transmissions 
n  and  the  desired  probability)  over  a  single-transmission  scheme.  For  small  Pee  and  large 
n,  the  gain  drops  below  1  for  the  scheme  without  CSI  (see  the  left  plot). 

Figure  4  compares  the  single-hop  retransmission-based  strategies  (n  transmissions)  with 
conventional  multi- hop  routing  over  n  hops  for  a  =  3.  It  can  be  seen  that  the  single- hop 
schemes  often  outperform  multi-hop  routing  if  the  same  delay  is  permitted.  Since  the  factor 
n  between  the  strategies  with  and  without  CSI  corresponds  to  an  increase  of  the  path  loss 
exponent  by  one,  we  find  in  general 


Gcsi(a)  —  fjNCSl(a  —  1)  • 


(46) 
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So,  the  left  plot  in  Fig.  4  shows  the  gain  Gcsi  for  a  =  4. 

Note  that  in  this  analysis,  we  did  not  take  into  account  the  higher  path  efficiency  of  the 
single-hop  scheme.  So,  the  effective  gain  is  slightly  bigger. 


4.2  Multiple  transmissions  in  the  short-hop  case 

If  the  delay  constraints  are  not  tight,  then  there  is  time  for  retransmissions  even  for  the 
short- hop  routing  scheme.  Indeed,  most  wireless  link  layer  protocols  are  based  on  ARQ, 
i.e.,  acknowledgments  and  retransmissions  are  employed  to  ensure  a  satisfactory  reliability. 
A  given  desired  link  reliability  Pl  can  be  achieved  by  a  single  transmission  or  multiple 
transmissions  at  lower  power.  The  following  Proposition  addresses  the  question  which  scheme 
is  energy-optimum. 


Proposition  4  (Optimum  power  level.)  The  energy- optimum  transmit  power  level  P/fvt 
to  achieve  a  link  reliability  Pl  >  1/2  is  given  by 


DoPt  _  ®Nda 

0  "  In  2 


(47) 


the  corresponding  single-transmission  reception  probability  is  p°pt  =  1/2,  and  the  necessary 
average  number  of  transmissions  is 10  nt  =  —  log2(l  —  Pl). 

Proof:  The  average  number  of  transmissions  to  achieve  Pl  is  given  by 


nt  = 


log(l-PL) 
log(l  -Pr) 

1 


Pr  <  Pl 
Pr  A  Pl 


So,  the  total  energy  consumption  is 


E(P0)  =ntP0=  bg(1  Ptl  -  P0 


log  ( 1  —  e  pod 


which  is  minimized  at 

D°Pt  •  CVD^  ®Nda 

P0  =  arg  mm  E(P0)  =  -  -  , 
Po 

corresponding  to  p°pt  =  1/2,  and,  in  turn,  nt  =  —  log2(l  —  Pl). 
10log2(-)  is  the  binary  logarithm,  while  log(-)  denotes  any  logarithm. 


(48) 


(49) 


(50) 
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Note  that  the  optimum  does  not  depend  on  PL.  The  factor  l/ln2  «  1.44  corresponds  to 
1.6dB,  hence  the  transmit  power  that  minimizes  the  overall  energy  consumption  is  1.6dB 
above  QNda.  The  minimum  energy  value  is  then  given  by 


With  £(P0single) 


E(P°pt)  =  ONda 


—  ln(l  —  Pl) 
(In  2)2 


d°iQN  ii 

lnp^  ,  the  energy  gam  is 


(51) 


£(P“ngfe) 


log2(l  -  Pl)  iog2  Pl  , 


(52) 


The  reduction  in  transmit  power  is  —  In  2/  In  Pi.  For  Pl  ~  1,  this  is  approximately  In  2/(1  — 
Pl)  or  In  2  divided  by  the  desired  packet  loss  rate.  The  gain  then  is  inversely  proportional  to 
the  packet  loss  rate  times  the  logarithm  of  the  packet  loss  rate.  For  Pl  =  99%,  for  example, 
the  reduction  in  transmit  power  is  ~  100  In  2  ~  69,  the  gain  is  slightly  bigger  than  lOdB, 
and  nt  ~  6.64.  So,  the  reduction  in  transmit  power  more  than  compensates  for  the  higher 
number  of  transmissions.  Of  course,  the  single-transmission  scheme  and  the  energy-optimum 
scheme  are  just  the  two  extreme  possibilities  in  the  energy-delay  trade-off. 

Again,  to  be  fair,  the  long-hop  scheme  should  be  permitted  the  same  delay.  For  the  case 
without  CSI,  if  the  short-hop  scheme  routes  over  n  hops  with  an  average  of  nt  transmissions 
at  each  hop,  the  long- hop  scheme  may  use  nnt  timeslots,  i.e.,  it  may  transmit  nnt  times 
instead  of  just  n  times.  The  required  reliability  is  Pee  =  P£ ,  so  we  bud  that  the  energy 
consumption  for  n  transmissions  is 


Pn  ^ 


QNd'a 

—  ln(l  —  (1  —  Pee )") 


(53) 


where  d'  denotes  the  distance  over  this  long  hop.  Permitting  nnt  transmissions  at  lower 
transmit  power,  we  have 


^  ©AhT 

Print  TITl-i  i  > 

—  ln(l  -  (1  -  Pee)™') 


(54) 


which  results  in  an  energy  gain  of 


En  _  1  log(l  ~  (1  ~  Pee)*"*) 
Print  nt  log(l  —  (1  —  Pee)u) 


(55) 


Since  we  know  that  the  optimum  single-transmission  reception  probability  is  1/2,  this  gain 
is  smaller  than  1  if  1  —  (1  —  Pee)™  %  1/2.  Therefore,  only  for  high  Pee  or  small  n,  there 
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is  a  benefit  in  transmitting  nnt  times.  In  contrast,  the  scheme  with  CSI  available  at  the 
transmitter  always  benefits  from  a  larger  number  of  timeslots. 

In  summary,  for  very  loose  delay  constraints,  short-hop  schemes  derive  more  benefit 
from  retransmissions.  Since  there  is  an  optimum  number  of  transmissions,  the  long-hop 
scheme  cannot  benefit  from  having  a  very  large  number  of  timeslots  available  with  this  simple 
retransmission  scheme  (unless  CSI  is  available).  In  this  case,  the  available  time  should  rather 
be  exploited  by  using  a  stronger  channel  code,  thereby  lowering  the  SINR  threshold  0. 

5  Concluding  Remarks 

We  have  offered  a  fresh  look  at  the  routing  problem  in  random  wireless  networks  by  taking 
into  account  Rayleigh  fading,  end-to-end  packet  delivery  probabilities,  node  distributions, 
and  path  efficiencies.  The  main  result  is  that,  from  an  energy  point  of  view,  routing  over 
many  short  hops  is  not  as  beneficial  in  a  Rayleigh  network  as  it  seems  to  be  for  the  deter¬ 
ministic  “disk  model” .  If  we  impose  a  delay  constraint  that  is,  for  a  fair  comparison,  loose 
enough  such  that  it  can  be  met  by  short-hop  routing,  long-hop  schemes  may  actually  save 
energy. 

Apart  from  energy  and  delay,  there  are  other  important  practical  issues  that  determine 
the  overall  performance  of  a  routing  scheme: 

Interference.  The  different  transmit  power  levels  of  the  different  routing  schemes  will 
change  the  interference  by  which  other  ongoing  transmissions  will  be  affected.  However,  if 
all  nodes  use  a  long-hop  strategy  rather  than  a  short-hop  strategy,  their  relative  power  levels 
will  stay  the  same,  and  the  interference  part  in  the  reception  probability  (5),  pJr,  remains 
constant.  Even  if  an  individual  source  node  chooses  to  adopt  a  long- hop  strategy,  the  total 
transmission  time  will  be  shorter,  which  may  offset  the  increase  in  power.  In  general,  it 
is  not  clear  whether  multiple  short  hops  or  a  single  long  hop  causes  more  interference  [10]. 
Total  radiated  energy  may  be  an  indicator. 

Diversity  schemes.  Path  and  cooperative  diversity  have  been  shown  to  enhance  the 
performance  of  wireless  networks  [19,23].  In  nearest-neighbor  routing,  there  is  little  room  to 
exploit  cooperative  strategies.  If  longer  hops  are  used,  a  number  of  intermediate  nodes  may 
overhear  the  transmission  and  assist  by  relaying  the  packet  if  needed  (provided  that  they 
are  not  asleep).  In  [4],  the  number  of  disjoint  and  independent  paths  are  investigated.  It 
has  been  shown  that  long-hop  routing  does  not  decrease  the  number  of  independent  paths 
but  increases  the  number  of  disjoint  paths. 
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Receiver  power  consumption  and  sleep  modes.  The  energy  consumption  for  packet  re¬ 
ception  may  be  substantial.  So,  nodes  that  are  not  part  of  an  active  route  (or  cooperative 
scheme)  should  be  put  to  sleep.  Long- hop  routing  permits  a  more  aggressive  use  of  sleep 
modes  than  short-hop  routing. 

Amplifier  characteristics.  If  the  efficiency  of  the  power  amplifier  depends  on  the  radiated 
power,  then  the  routing  scheme  should  ensure  that  the  PA  mostly  operates  close  to  its 
saturated  power  Pmax  [24],  In  this  case,  the  variance  of  the  distances  becomes  important, 
and  nearest- neighbor  routing  should  be  avoided,  as  pointed  out  in  [25]. 

Route  acquisition  time.  Here,  the  long-hop  schemes  have  a  clear  advantage,  which  is 
particularly  important  for  on-demand  routing  algorithms  [26]. 

Route  breaks  due  to  mobility  and  node  failure.  With  less  relaying  nodes,  the  probability 
that  new  routes  have  to  be  discovered  due  to  failing  or  moving  nodes  is  smaller.  Hence  the 
robustness  of  the  network  is  bigger.  The  availability  of  intermediate  nodes  also  increases  the 
network’s  capability  of  route  repair  and  maintenance. 

Energy  and  delay  balancing.  Often,  the  data  gathered  by  a  sensor  networks  has  to  be 
delivered  to  a  single  point,  the  observer  or  base  station.  Using  nearest-neighbor  routing,  the 
lifetime  of  a  node  close  to  the  base  station  is  much  shorter  than  the  lifetime  of  a  node  far 
away  due  to  traffic  accumulation  [27].  With  long- hop  routing,  the  energy  consumption  is 
better  balanced.  Instead  of  just  a  few  critical  nodes,  there  are  many  more,  which  results  in 
an  increase  of  the  network  lifetime. 

In  conclusion,  long-hop  strategies  have  a  clear  advantage  in  most  of  these  categories,  in 
addition  to  being  competitive  in  terms  of  energy. 
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Abstract 

The  lifetime  of  wireless  sensor  network  is  crucial,  since  autonomous  operation  must  be  guar¬ 
anteed  over  an  extended  period.  Usually  all  the  sensor  data  has  to  be  forwarded  to  an  observer 
or  base  station  via  multihop  routing,  which  causes  the  traffic  pattern  in  the  network  to  be  highly 
nonuniform  and  puts  a  high  burden  on  the  sensor  nodes  close  to  the  observer.  We  propose  and 
analyze  five  strategies  that  balance  the  energy  consumption  of  the  nodes  to  increase  the  lifetime 
of  the  network  substantially:  distance  variation  with  and  without  rate  adaptation,  compression, 
routing,  and  reliability  balancing.  Since  several  of  these  schemes  affect  the  transmission  delay 
significantly,  remarks  on  their  delay  performance  are  also  given.  The  analyses  are  based  on  a 
non- fading  channel  model  with  additive  white  Gaussian  noise  (AWGN)  and  a  Rayleigh  fading 
model. 

Keywords  —  Sensor  networks,  radio  communication,  AWGN  channels,  packet  switching. 
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1  Introduction 


Large-scale  networks  of  integrated  wireless  sensors  become  increasingly  tractable,  as  advances  in 
hardware  technology  and  engineering  design  have  led  to  dramatic  reductions  in  size,  power  con¬ 
sumption,  and  cost  for  digital  circuitry,  wireless  communications,  and  MEMS.  This  enables  very 
compact  and  autonomous  nodes,  each  containing  one  or  more  sensors,  computation  and  communi¬ 
cation  capabilities,  and  a  power  supply.  Multihop  routing  is  typically  used  to  reduce  the  transmit 
power  and,  consequently,  increase  the  battery  lifetime  and  decrease  the  interference  between  the 
nodes,  thereby  allowing  spatial  reuse  of  the  communication  channel. 

Wireless  sensor  networks  [1,  2]  differ  from  other  types  of  multihop  wireless  networks  by  the 
fact  that,  in  most  cases,  the  sensor  data  has  to  be  delivered  to  a  single  sink,  the  observer  or  base 
station  (BS).  Clearly,  one  of  the  primary  concerns  is  the  lifetime  of  the  network.  Although  different 
definitions  of  lifetime  exist  [3] ,  a  sensor  network  certainly  has  to  be  considered  “dead”  whenever  it  is 
no  longer  able  to  forward  any  data  to  the  BS.  We  assume  that  every  sensor  node  in  the  network  has 
an  equal  probability  of  generating  data  packets  that  are  forwarded  to  the  BS  via  multihop  routing 
using  other  sensor  nodes  as  relays.  Apparently,  the  burden  on  the  nodes  close  to  the  base  station  is 
considerably  higher  than  on  the  nodes  that  are  far  away.  Figure  1  depicts  a  possible  arrangement  of 
sensor  nodes  and  identifies  the  most  critical  nodes  in  the  network.  Without  appropriate  measures, 
they  will  die  quickly,  rendering  the  network  useless.  This  is  a  crucial  problem  in  sensor  networking 
that  has  received  surprisingly  little  attention. 

While  sensor  networks  are  the  most  prominent  examples  for  such  many-to-one  traffic,  there  are 
two  other  classes  of  networks  where  the  same  problems  occurs: 

•  Multihop  Extension  of  Wireless  Local  Area  Networks  (WLANs).  To  extend  the 
coverage  are  of  the  WLAN,  every  node  also  acts  as  a  router,  forwarding  packets  on  behalf  of 
other  nodes  that  are  further  away  from  the  BS  (or  access  point  or  gateway)  [4].  To  account 
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Figure  1:  A  sensor  network  with  a  base  station. 

for  user  mobility,  the  network  is  self-organizing  and  self-configuring,  similarly  to  a  pure  ad 
hoc  network.  In  [5],  the  authors  show  that  the  per-node  capacity  decrases  in  proportion  to 
the  number  of  nodes,  but  they  do  not  address  energy  issues. 

•  Multihop  Cellular  Networks.  In  [6],  a  new  architecture,  the  multihop  cellular  network 
(MCN)  has  been  proposed  as  a  viable  alternative  to  the  conventional  single-hop  cellular 
network  (SCN)  by  combining  the  features  of  SCNs  and  ad  hoc  networks.  The  BS  in  MCN  is 
acting  as  a  special  relay  or  gateway  to  support  inter-cell  communications  or  connections  to 
the  backbone  telephone  network.  While  [6]  permitted  direct  communication  between  mobile 
stations  (MSs)  within  a  cell,  this  was  excluded  in  subsequent  work  [7-10],  where  all  MSs  are 
restricted  to  communicate  with  the  BS  only,  z.e.,  a  multihop  many-to-one  or  one-to-many 
scheme.  In  other  words,  the  traffic  of  one  MS  can  be  forwarded  by  other  MSs  to  reach  the 
BS. 

In  this  paper,  we  propose  and  discuss  strategies  to  ensure  maximum  lifetime  of  the  network  by 
balancing  the  energy  load  as  equally  as  possible. 

The  analysis  is  based  on  channel  models  with  and  without  Rayleigh  fading  and  AWGN  (additive 
white  Gaussion  noise).  Such  models  characterize  the  wireless  link  more  accurately  than  the  “disk 


80 


model”  that  is  often  used,  where  it  is  assumed  that  the  radius  for  a  successful  transmission  of 
a  packet  has  a  fixed  and  deterministic  value,  irrespective  of  the  condition  and  realization  of  the 
wireless  channel  [11-14].  Such  simplified  link  models  ignore  the  probabilistic  nature  of  the  wireless 
channel  and  the  fact  that  the  signal-to-noise-and-interference  ratio ,  that  determines  the  success  of 
a  transmission,  is  a  random  variable.  The  volatility  of  the  channel  cannot  be  ignored  in  wireless 
networks  [3,15];  the  inaccuracy  of  “disk  models”  has  also  been  pointed  out  in  [16]  and  is  easily 
demonstrated  experimentally  [17,18].  The  (non-fading)  AWGN  model  used  in  this  paper  relates 
the  signal-to-noise  ratio  (SNR)  and  the  capacity  of  a  channel,  whereas  the  Rayleigh  fading  model 
relates  the  average  SNR  with  the  success  of  a  transmission  at  a  given  rate.  The  end-to-end  packet 
delivery  probability  over  a  multihop  route  is  the  product  of  the  link-level  reception  probabilities. 

2  Link  Models 

2.1  AWGN  channels 

The  complex  AWGN  channel  has  a  capacity  C  =  log2(l+7),  where  7  =  LPod~a /Nq  is  the  signal-to- 
noise  ratio  (SNR),  Pq  the  transmit  power,  d  the  link  distance,  a  the  path  loss  exponent,  Nq  the  noise 
variance,  and  L  some  constant  that  depends  on  the  antenna  gains  and  wavelength.  For  simplicity, 
we  will  henceforth  assume  L  —  1.  In  the  presence  of  normally  distributed  interference,  the  SNR 
can  be  augmented  to  a  signal-to-noise-and-interference  ratio  (SINR)  by  adding  the  interference  to 
the  noise  term.  We  assume  that  the  channel  code  is  capacity-achieving,  i.e.,  that  we  can  transmit 
reliably  at  rate  R  —  C.  Note  that  for  a  practical  channel  code  with  finite  block  length,  we  necessarily 
have  R  <  C  and/or  error-prone  transmissions;  however,  this  loss  in  throughput  (or  the  additional 
energy  expenditure)  does  not  change  our  comparative  results.  Given  P,  the  transmit  power  is 
Po  =  (2R  —  l)N0da.  If  the  packet  transmission  time  is  normalized  to  unity,  the  power  equals  the 
transmission  energy.  We  will  denote  by  Pq  the  energy  required  to  transmit  one  packet  at  rate  R 
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over  a  unit  distance,  i.e.,  Eq  =  (2R  —  1)Nq.  In  general,  E  =  daE$. 


2.2  Rayleigh  fading  channels 

This  model  is  based  on  a  flat  Rayleigh  block  fading  channel  [19].  The  transmission  is  successful  if 
the  SINR  7  at  the  receiver  is  above  a  certain  threshold  ©  that  is  determined  by  the  transmission 
rate,  the  communication  hardware,  and  the  modulation  and  coding  scheme  (normally  between  1 
and  100  or  OdB  and  20dB).  In  any  case,  0  >  2R  —  1  due  to  capacity  and  rate  considerations.  With 
the  assumptions  above,  the  received  power  Q  is  exponentially  distributed  with  mean  Q  =  Pod~a. 
The  following  theorem  proves  useful  for  the  analysis: 


Theorem  1  In  a  Rayleigh  fading  network,  the  reception  probability  pr  —  P[y  ^  ©]  can  be  factorized 
into  the  reception  probability  of  a  zero-interference  network  p ^  and  the  reception  probability  of  a 
zero-noise  network  pTr  as  follows: 


pr  =  exp 


V  p0d^aJ  Mi 


i 


f-V;,  fii  +  ©S©°’ 


(i) 


where:  do  is  the  distance  of  the  desired  transmitter;  d{,  1  ^  i  ^  k  are  the  distances  of  the  k 
interferers ;  and  Pi  are  their  transmit  power  levels. 


Proof:  Let  Qo  denote  the  received  power  from  the  desired  source  and  Qi,  i  —  1 

the  received  power  from  k  interferers.  All  the  received  powers  are  exponentially  distributed,  i.e., 
PQiili)  —  1/Qie~ri^i ,  where  Qi  denotes  the  average  received  power  Qi  —  Pid^a.  Conditioned  on 
the  total  interference  I  —  J^i  —  1  kQi,  the  probability  of  correct  reception  is  a  simple  exponential, 
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so  we  have 


pr=E/[P[Qo^e(/  +  JVo)  \I ]] 


=E/ 


exp 


/  e(/  +  AT0)x 

V  Qo  > 


Jo  Jo  ^  Qo  '  i=1 


PQife)  dgi  •  •  -  dgfe, 


from  which  (1)  follows.  ■ 

Note  that  the  SNR  yw,  the  SIR  7/,  and  the  SINR  7  are  related  as  7  =  7w7//(7w  +  7/),  i.e., 
the  SINR  is  half  the  harmonic  mean  of  the  SNR  and  SIR,  or  SINR-1=SNR-1+SIR-1. 


This  allows  an  independent  analysis  of  the  effect  caused  by  noise  and  the  effect  caused  by 
interference.  The  focus  of  this  paper  is  put  on  the  noise,  i.e.,  on  the  first  factor  in  (1).  If  the  load 
is  light  (low  interference  and  small  collision  probability),  then  SIR^SNR,  and  the  noise  analysis 
alone  provides  accurate  results.  For  high  load,  a  separate  interference  analysis  has  to  be  carried 
out  [20-22] 1 . 

In  a  zero-interference  network,  the  reception  probability  over  a  link  of  distance  d  at  a  transmit 
power  Po,  is  given  by 

__0iVo_ 

Pr  :=  P[7iV  >  e]  =  e  .  (2) 


Solving  for  Po,  the  necessary  transmit  power  to  achieve  a  link  reliability  (or  reception  probability) 

pr  is  Po  =  —  daQNo/  lnpr.  It  is  assumed  that  the  packet  transmission  time  is  one  time  unit,  so 

the  energy  required  for  the  transmission  of  one  packet  over  a  distance  d  at  reliability  pr  is  equal  to 

Po.  Thus  for  the  Rayleigh  fading  channel,  we  define  Po  to  be  the  energy  for  a  transmission  over 

distance  1  with  reliability  pr,  i.e.,  Po  =  —  QNq/  In pr.  Again,  in  general,  P  =  daE$. 

1Note  that  power  scaling ,  i.e.,  scaling  the  transmit  powers  of  all  the  nodes  by  the  same  factor,  does  not  change 

the  SIR,  but  (slightly)  increases  the  SINR. 
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Figure  2:  A  one-dimensional  chain  of  wireless  sensor  nodes.  It  is  assumed  that  the  distances  sum 
up  to  N,  i.e.,  the  average  distance  is  1. 

3  Energy-Balancing  Strategies 

We  assume  that  every  sensor  node  generates  an  equal  amount  of  traffic  of  a  certain  arrival  rate 
that  is  relayed  to  the  base  station  along  the  shortest  route.  Since  optimum  routes  approximately 
follow  a  straight  line,  the  analyses  of  the  strategies  proposed  in  this  section  can  be  restricted  to 
one-dimensional  chains  of  TV  nodes,  as  shown  in  Fig.  2.  Without  loss  of  generality,  it  is  assumed 
that  the  internode  distances  sum  up  to  TV,  i.e.,  the  distance  between  node  1  and  the  BS  is  TV  and 
the  average  distance  is  1.  The  strategies  are  compared  with  the  simple  scheme  that  has  equal  node 
distances  d—  1,  equal  link  reception  probabilities  pr,  and  employs  nearest-neighbor  routing  (node 
i  transmits  to  node  i  +  1  for  i  =  1 , ...  ,7V  —  1,  and  node  TV  transmit  to  the  BS).  To  compare  the 
total  energy  consumption,  it  is  sufficient  to  calculate  the  energy  requirements  to  forward  one  packet 
from  each  of  the  nodes  to  the  BS.  For  the  simple  strategy,  the  total  energy  consumption  is 

Etot  =  (1  +  2  +  . . .  +  N)E0  =  (3) 

since  node  i  has  to  forward  i  packets.  For  the  lifetime,  the  critical  node’s  energy  consumption  has 
to  be  determined.  For  the  simple  strategy,  the  critical  node  is  node  TV,  and  i?max  =  En  —  NEq. 

Although  our  focus  in  on  the  energy  consumption,  we  also  consider  delay  aspects,  since  delay 
and  transmission  rates  are  closely  intertwined  with  energy.  Assuming  that  spatial  reuse  of  the 
communication  channel  permits  every  q- th  link  to  be  active  simultaneously,  the  total  number  of 
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timeslots  to  deliver  one  packet  for  each  node  to  the  BS  is  given  by 

D  =  qN-  ,  for  N  >  q  -  1 .  (4) 

For  TV  ^  g,  no  spatial  reuse  is  possible  and  the  delay  corresponds  to  the  total  number  of  transmis¬ 
sions  TV(TV  +  l)/2.  For  larger  TV,  thanks  to  the  spatial  reuse,  the  delay  grows  only  linearly  in  TV 
although  the  number  of  transmissions  increases  with  TV2. 


3.1  Distance  variation  without  rate  adjustment 


We  assume  nearest-neighbor  routing.  The  idea  is,  given  a  desired  rate  R  (AWGN  channel)  or  link 
reliability  pr  (Rayleigh  fading  channel),  to  ensure  energy-balancing  by  adjusting  the  distances  d{ 
between  the  nodes.  If  every  node  generates  one  packet,  node  i  has  to  forward  a  total  of  i  packets 
using  a  total  energy  of  E{  —  idfEo.  The  goal  E\  =  E2  =  . . .  =  Ejsj  requires  that  all  the  factors  idf 
be  identical,  and  the  sum  of  all  the  distances  is  TV.  We  find  d{  —  d\ i~1/a  and 


TV 


di  ~  ^ N  .1 


■~l/a 


Eli  i-1'* 


(5) 


To  obtain  more  insight,  we  derive  an  accurate  approximation  based  on  the  harmonic  mean- geometric 

v-i  • 


mean 


mean 


inequality.  Observe  that  d{  —  -i  1/a,  where  Tiiai)  =  TV  (Eli  Va0  ls  the  harmonic 

di  can  be  upperbounded  by  ^(i_1/a)i_1/a,  where  Q(ai)  —  ^nli^)1^  th e  geometric 


mean: 


dx  =  H(i1/a)  <  G(i1/a)  =  (TV!)^v  .  (6) 

With  Stirling’s  approximation  TV!  ^  Nn e~N \f2iiN  (which  is  within  1%  for  TV  >  8),  this  simplifies 
to 


TV 

d\  E  (  —  )  (27tTV) 


(7) 


The  geometric  mean  is  larger  than  the  actual  value  (harmonic  mean),  but  Stirling’s  approximation 


lower  bounds  the  factorial.  In  total,  (7)  is  a  tight  upper  bound,  as  illustrated  in  Fig.  3.  For  a  =  4, 
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Figure  3:  Internode  distances  for  distance  variation  strategy  without  rate  adjustment.  7V=10  nodes, 
a  —  2, 4. 

its  accuracy  is  within  3%  for  N  ^  100,  for  a  =  3,  it  is  within  5%  for  N  ^  60,  and  for  a  =  2 
within  10%  for  N  ^  30.  Dropping  the  second  factor  in  (7)  yields  the  even  simpler  expression 
d\  ~  (N/e)l/a ,  which  turns  out  to  be  a  tight  lower  bound  for  N  ;S  7a. 

The  energy  consumption  at  each  node  is  dfEo  ~  ( N/e)Eo .  So,  the  gain  (reduction)  in  total 
energy  consumption  is 

G'*~l-Nwikr2  =  ^mrT)K1-Veam°-  <8) 

The  gain  in  lifetime  is  considerably  higher,  namely  a  factor  of  e,  since  node  N  now  only  consumes 
N/e  energy  units  rather  than  N. 

This  strategy  leads  to  a  non-uniform  distribution  of  the  sensor  nodes,  which  may  not  be  de¬ 
sirable.  However,  the  distribution  is  not  too  far  from  uniform,  in  particular  for  high  path  loss 
exponents,  as  is  manifested  by  the  small  variance  of  d\.  For  N  —  10  and  a  =  2,  the  variance  is 
about  0.18.  For  a  =  3,4,5,  the  variances  are  0.073,  0.039,  0.024. 
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3.2  Distance  variation  with  rate  adjustment 


In  this  approach,  the  rate  of  transmission  is  adjusted  based  on  the  amount  of  data  to  be  forwarded. 
If  at  every  hop  the  full  amount  of  data  is  to  be  transmitted  in  a  single  timeslot,  the  rate  needs 
to  increase  linearly  with  i,  i.e.,  Ri  oc  i.  Since  the  transmit  power  should  be  the  same  for  all 
nodes,  the  SNR  7 *  =  cdT°  can  only  be  increased  by  decreasing  the  internode  distance.  From 
Ri  —  log2(l  +  cd~a )  =  iRx  and  c  =  d" 71  follows 


d?  =  cff  d i  =  d1 
7 * 


2Rl  -  1 
1  2*ri  -  1  ' 


(9) 


The  distances  dt  sum  up  to  N,  so  d\  =  N /  Xih(7i hiY^a  and 

dr  _  ^  m 
EZ i7._  /°  £ii(2'Kl  -  1)-V“ 

The  rate  R\  of  the  first  link  is  a  design  parameter.  In  the  following,  approximations  for  d{  and  the 
energy  consumption  are  derived,  discussing  the  high  rate  ^  1)  and  the  low  rate  (R\  ~  1  /N) 
cases  separately. 

High  rate:  Since  2lRl  ^  1  for  most  i,  the  sum  in  the  denominator  of  (10)  can  be  approximated 
as  a  geometric  sum,  i.e.,  J^iLi  2~lRl/a  —  (1  —  2~RlN/a)/(2Rl^a  —  1),  which  yields 


_  N  2Rl/a  -  1 

~  (2iRl  —  1  )Va  1  —  2~RlN/a 


(ii) 


The  term  2~RlN/a  is  negligible  for  large  N/a,  which  shows  that  the  distances  increase  approximately 
linearly  with  N  and  that  goes  to  zero  quickly  as  N  grows.  For  R\  =  1  (which  implies  Eq  =  N$), 
we  obtain  d\  ^  N(21^a  —  1)  and  djy  «  N2~n/ol(21/ol  —  1).  This  case  is  illustrated  in  Fig.  4.  The 
energy  consumption  at  each  node  is  E  =  E\  —  d^No  =  Na(21/a  —  l)a7Vo,  so  the  total  energy  grows 
with  7Va+1,  and  the  total  delay  is  N. 

Low  rate:  Since  2lRl  —  1  1  for  most  i,  we  employ  the  first  order  Taylor  approximation  2lRl  — 

1  ~  ii?iln2  and  the  harmonic  mean- geometric  mean  inequality.  The  geometric  mean  Q((2lRl  — 

87 


Figure  4:  Internode  distances  for  distance  variation  with  rate  adjustment  for  R\  —  1  (high  rate), 
Ar=10  nodes,  a  =  2,4. 

l)1//a)  (^x  ln2)1/a(A^!)1/(aiV).  Applying  a  crude  version  Stirling’s  formula,  N\  «  ( N/e)N:  and 

noting  that  ln2/e  ~  1/4,  we  obtain  the  simple  approximation  Q  ~  (i?iA^/4)1/a.  Therefore 

*"(4(25”  -jy)  “  (2iriV)rig  ,  (12) 

which  is  shown  in  Fig.  5  together  with  the  exact  value  from  (10).  The  energy  consumption  at  each 
hop  is 

Ei  =  E  =  d?E0  »  —A ./Vo  •  (13) 

For  R\  =  1/iV,  this  simplifies  to  E  «  iVo/4,  so  the  total  energy  consumption  grows  only  linearly 
with  N.  On  the  other  hand,  the  delay  is  N 2  for  this  low-rate  scheme. 

Remarks: 

•  The  low-rate  case  looks  quite  similar  to  the  case  without  rate  adaptation.  The  variance  is 
just  slightly  larger  in  the  low-rate  case,  namely  25%.  In  the  high-rate  case,  the  variance  may 
be  unacceptably  high,  hence  it  would  be  difficult  to  achieve  good  coverage. 
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Figure  5:  Internode  distances  for  distance  variation  with  rate  adjustment  for  R\  =  1/N  (low  rate), 
Ar=10  nodes,  a  =  2,4. 

•  Between  no  rate  adaptation  and  full  rate  adaptation,  there  is  a  host  of  other  possibilities. 
Depending  on  the  desired  performance  on  the  energy/delay  curve,  the  rates  of  the  nodes  can 
be  adapted. 

3.3  Balanced  data  compression 

If  the  internode  distances  are  all  equal  ( d{  —  1),  the  incoming  data  flows  may  be  compressed  to 
ensure  that  every  node  in  the  path  has  to  transmit  the  same  number  of  packets.  The  justification 
for  this  approach  is  the  correlation  between  the  sensor  readings  of  neighboring  nodes.  Hence,  data 
fusion  may  be  applied  to  reduce  the  amount  of  data  to  be  transmitted.  The  goal  is  to  ensure  that 
every  packet  experiences  the  same  compression  factor,  irrespective  of  its  origin.  At  each  node  i, 
the  incoming  data  is  compressed  by  a  factor  of  oq,  while  the  locally  generated  data  is  compressed 
by  bi  =  1  —  di.  Equal  compression  is  achieved  when  bi  =  1/z,  since  the  total  compression  factor  for 
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a  packet  generated  at  node  i  is 


N  1  N  1  , 

«=(»,■  n  “<=7'  n  i}~k)  =  N  vi-  (i4) 

k=i+ 1 

This  way,  every  node  transmits  only  one  packet,  so  the  total  energy  consumption  is  NEq ,  and  the 
gain  in  lifetime  is  a  factor  of  TV.  On  the  other  hand,  the  accuracy  of  the  received  data  will  suffer. 
The  total  transmission  delay  is  TV,  but  the  computational  or  processing  delay  also  needs  to  be  taken 
into  account. 


3.4  Routing 


We  again  assume  equal  distances  d{  —  1  between  the  nodes  but  no  longer  restrict  the  network  to 
strict  nearest-neighbor  routing.  Instead,  we  assume  that  node  i  transmits  the  locally  generated 
traffic  to  the  next  neighbor  with  probability  oq  and  directly  to  the  sink  with  probability  bi  =  1  —  a{. 
Relay  traffic  will  always  be  forwarded  to  the  next  node.  The  goal  is  to  choose  a\  to  achieve  energy 
balancing. 

All  energies  in  the  following  derivation  are  normalized  by  E q.  The  energy  consumption  at  node 


i  is 


Ei=(N  -i  +  iybi  +  Y^ak  (15) 

k= 1 

i—  1 

=i  +  ((N  -  i  +  1)Q  -  l)bi  -  .  (16) 

k= 1 


&7V  =  0,  as  node  N  always  transmits  directly  to  the  BS.  Ejy  —  TV  —  b\  —  62  —  •  •  •  —  b/v-i  — 
a\  +  Ci2  +  •  •  •  +  ajsi.  The  TV  —  1  unknowns  can  thus  be  determined  by  solving 


Na  1 

.  1 

bi 

N  —  1 

0  (N-  1)"  . 

.  1 

b2 

= 

N  —  2 

0  0 

. .  2“ 

bN- 1 

1 

(17) 
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to  equalize  the  energy  consumption  at  every  node  to  Ejy.  For  a  network  with  5  nodes,  the  values 
for  are  0.0301,  0.0438,  0.0694,  0.1250,  0  for  a  =  3.  Since  some  packets  are  routed  to 

the  BS  in  a  single  hop,  the  total  energy  consumption  is  bigger  than  in  the  simple  strategy.  For 
N  =  10,  the  additional  energy  consumption  is  between  60%  (a  =  2)  and  80%  for  (a  =  5).  On  the 
other  hand,  there  is  a  slight  gain  in  network  lifetime,  as  the  direct  routing  of  some  of  the  packets 
reduces  the  burden  on  node  TV.  Therefore  the  sum  of  the  af  s  is  smaller  than  TV.  For  10  nodes, 
the  increase  in  lifetime  is  0.5%  for  a  =  5  and  14%  for  a  =  2.  For  TV  — ►  oo,  the  total  additional 
energy  consumption  reaches  100%  and  the  gain  in  lifetime  vanishes2.  Hence  this  strategy  is  useful 
for  smaller  TV,  or  if  there  are  a  some  high-priority  packets  that  have  to  be  delivered  with  minimum 
delay.  The  average  delay  for  packets  generated  at  node  i  is  oq(TV  —  i  +  1)  +  (1  —  ai)  =  a^(TV  —  i)  +  1. 

3.5  Equalization  of  the  end-to-end  reliability 

While  the  last  to  strategies  are  applicable  to  both  AWGN  and  fading  channels,  here  we  focus 
only  on  Rayleigh  fading  channels.  So  far,  we  ignored  the  fact  that  the  end-to-end  reliability  of 
a  multihop  path  is  the  product  of  the  reception  probabilities  of  the  links.  With  constant  link 
reception  probabilities  pr,  a  packet  traveling  over  k  hops  only  arrives  at  the  sink  with  a  probability 
Pkr- 

We  investigate  a  strategy  where  every  packet,  irrespective  of  how  far  away  from  the  base  sta¬ 
tion  it  is  generated,  has  the  same  end-to-end  probability  pee  to  arrive  at  the  base  station.  This 
equal- end-to-end-probability  or  reliability  balancing  strategy  is  henceforth  referred  to  as  strategy  A, 
whereas  the  simple  equal-power  strategy  is  denoted  as  strategy  B. 

Analysis  of  strategy  A.  If  the  desired  end-to-end  reliability  is  pee,  the  link  probability  in 

2It  is  easily  established  that  bk  ^  (N  —  k)  •  (. N  —  k  +  l)_a.  With  En  >  (1  —  &fc),  we  get  En  >  Ek=i  t  — 

k  •  (k  +  l)-a.  For  a  ^  2,  the  terms  in  the  sum  approach  1  for  large  k ,  thus  En  — >  N  for  large  TV,  which  is  the  same 
as  in  the  simple  strategy. 
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1  fk 

a  k— hop  connection  is  pLk  —  Pee*  Accordingly,  the  transmit  power  at  each  hop  in  a  k— hop 
connection  with  equal  distances  d{  —  1  is3 


pA  _ 


©7V0 


~HPee, 


k  • 


©7V0 


(18) 


In  pee 

Clearly,  a  transmission  over  k  hops  requires  a  k  times  higher  transmit  power  level  (at  each  hop)  than 
a  transmission  over  one  hop  with  the  same  probability.  Thus  the  total  energy  needed  for  a  packet 
to  travel  from  node  N  —  k  +  1  to  the  base  station  is  proportional  to  k2 .  Note  that  a  single  large 
hop  of  length  k  would  require  an  energy  proportional  to  ka.  Let  Eq  denote  the  energy  required  to 
transmit  one  packet  over  one  hop  of  distance  1  with  a  reliability  of  pee?  Le.,  Eq  :=  QNq/(—  In  pee)- 
Using  nearest-neighbor  routing,  the  energy  consumption  is: 


Node 

Energy  consumption 

(in  units  of  E$ ) 

i 

N 

=  N 

2 

N  +  (N-  1) 

=  2N  -  1 

3 

(2 N  -  1)  +  (N  -  2) 

CO 

1 

CO 

II 

4 

41V -6 

i 

iN 

The  total  energy  consumption  (assuming  one  packet  is  generated  at  every  node)  is 


UA 


N 


tot 


=y im- 


i(i  - 1) 


Eo 


=N  ■ 


N(N  +  1) 


0  +  1  +  3  +  6  +  ...+ 


N(N  —  1) 


N(N  +  1)  /  N(N  —  1)  N(N-l)(N-2) 

2  V  2  +  6 

_iV3  N 2  N 

~~3~ +  ~Y  +  V’ 


(19) 


3 Note  that  this  implies  that  a  single  node  uses  different  power  levels  that  depend  on  the  origin  of  a  packet.  The 
power  levels  are  assigned  to  flows,  not  to  nodes. 
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where  we  have  exploited  the  fact  that  the  sum  S  in  the  second  line  is  an  arithmetic  series  of  order 
2  with  qo  —  0,  Ago  —  A2go  =  1- 

Analysis  of  strategy  B.  Here,  all  nodes  transmit  at  a  fixed  power  level,  corresponding  to 
a  fixed  link  reception  probability  pr.  For  a  fair  comparison,  it  is  assumed  that  the  application 
dictates  a  minimum  end-to-end  reliability  pee-  Packets  generated  at  node  i  have  to  travel  over 
k  :=  N  —  i  +  1  hops,  resulting  in  an  end-to-end  reliability  of  pjf.  To  ensure  that  the  packets  from 
the  farthest  node  (node  1)  arrive  with  probability  pee?  a  link  reception  probability  of  pr  =  p^?  is 
required.  The  energy  per  hop  is  in  this  case  is  (cf.  (3)) 

#0  =  daQN0/(-  In  Pl)  =  N  ■  daQN0/{- In pEE)  =  NE 0A  . 


The  total  energy  consumption  of  the  network  (assuming  one  packet  is  generated  at  every  node)  is 


Kt  = 


_N(N  +  1)fb_(N3  |  iV2 


V  2 


EZ 


(20) 


Comparison.  The  ratio  between  the  energy  consumption  of  the  two  strategies  is 

771A  N2  1  N_  _j_  1  r\  -1 

^tot  _  3  '  2  '  6  _  f  1  (cyx  \ 

®«o,  ¥  +  T  3  3JV' 

For  large  A,  this  ratio  approximates  2/3,  hence  the  gain  in  total  energy  consumption  for  the 
reliability  balancing  strategy  is  33%.  More  important  and  more  significant  is  the  gain  in  network 
lifetime,  which  is  determined  by  the  lifetime  of  the  critical  node  N.  In  strategy  A,  the  energy 
consumption  at  node  N  is  E ^  =  Eq(N2  +  iV) /2,  whereas  for  B,  it  is  E ^  —  E^N  =  E^N2.  The 
ratio  is  2 N/(N  +  1),  thus  the  gain  in  network  lifetime  approaches  2  for  large  N.  Figure  6  compares 
the  two  strategies.  For  strategy  B,  the  energy  consumption  increases  linearly  with  the  node  number, 
and  the  end-to-end  probability  for  a  packet  generated  at  node  i  increases  monotonically.  For  scheme 
A,  the  energy  consumption  is  more  balanced,  and  the  end-to-end  probability  is  constant. 
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Figure  6:  Comparison  of  the  equal-power  strategy  B  (dashed)  and  the  equal-end-to-end-probability 
strategy  A  (solid)  for  pee  =  0.6.  The  left  plot  shows  the  relative  energy  consumption  when  every 
node  generates  one  packet,  the  right  plot  the  end-to-end  probabilities  for  traffic  generated  at  nodes 
1,2,. ..,6. 
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4  Concluding  Remarks 


For  sensor  networks,  where  the  destination  of  all  the  information  gathered  at  the  sensor  nodes  is 
a  single  base  station,  the  traffic  pattern  is  highly  non-uniform,  since  the  nodes  close  to  the  base 
station  have  to  relay  all  the  data  packets.  Consequently,  those  nodes  are  the  first  to  run  out  of 
battery,  thereby  restricting  the  lifetime  of  the  network.  Several  strategies  have  been  proposed  to 
balance  the  energy  consumption,  each  of  them  having  their  application-dependent  strengths  and 
weaknesses.  The  strategies  are  not  mutually  exclusive,  hence  two  or  more  may  be  combined  into 
hybrid  schemes.  For  example,  since  the  distance  variation  strategy  does  not  provide  reliability 
balancing,  both  could  be  applied  jointly  to  achieve  both  energy  and  reliability  balancing. 

The  analyses  are  based  on  a  AWGN  channel  model  and  on  a  Rayleigh  fading  model.  Under 
Rayleigh  fading,  noise  issues  and  interference  issues  can  be  analyzed  separately,  and  that  the 
reception  probability  is  an  exponential  function  of  the  transmit  power.  This  non-zero  probability 
of  a  packet  loss  even  if  nodes  are  close  (or  within  “transmission  range”)  is  often  ignored.  It  entails 
that  a  transmission  over  k  hops  requires  a  k  times  higher  transmit  power  at  each  hop  to  guarantee 
the  same  end-to-end  probability  as  a  single-hop  transmission.  An  important  consequence  is  that  the 
energy  benefit  of  multihop  routing  becomes  less  significant.  Indeed,  since  the  energy  consumption 
is  proportional  to  k2  in  the  hop-by-hop  transmission  and  proportional  to  ka  in  a  single-hop  scheme 
(see  Section  3.5),  the  benefit  vanishes  for  a  —  2. 
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Abstract — For  multihop  wireless  networks,  a  fundamental 
question  is  whether  it  is  advantageous  to  route  over  many  short 
hops  (short-hop  routing)  or  over  a  smaller  number  of  longer  hops 
(long-hop  routing).  Short-hop  routing  gained  a  lot  of  support,  and 
its  proponents  mainly  produce  two  arguments:  reduced  energy 
consumption  and  less  interference.  Both  arguments  stem  from  an 
oversimplified  analysis  that  is  based  on  inaccurate  channel  models 
and  neglects  delay,  end-to-end  relability,  bias  power  consumption, 
the  impact  of  channel  coding,  mobility,  and  routing  overhead.  In 
this  paper,  we  shed  more  light  on  these  issues  by  listing  twelve 
reasons  why  short-hop  routing  is  not  as  beneficial  as  it  seems  to 
be.  The  conclusion  is  that  for  many  networks,  long-hop  routing 
is  in  every  aspect  a  very  competitive  strategy. 

I.  Introduction 

For  certain  wireless  networks,  such  as  ad  hoc  and  sensor 
networks,  a  fundamental  question  is  whether  it  is  advantageous 
to  route  over  many  short  hops  (, short-hop  routing  or,  in  the 
extreme  case,  nearest-neighbor  routing)  or  over  a  smaller 
number  of  longer  hops  ( long-hop  routing).  Recently,  this 
debate  extended  to  multihop  extensions  of  WLANs  [1]  and 
multihop  cellular  networks  [2] .  Short-hop  routing  gained  a  lot 
of  support,  and  its  proponents  mainly  produce  the  following 
two  arguments: 

1.  Energy  consumption.  If  a  long  hop  of  distance  d  is  divided 
into  n  hops  of  distance  d/n,  the  energy  benefit  is  often 
assumed  to  be  na_1,  where  a  is  the  path  loss  exponent. 

2.  Capacity.  The  shorter  the  hops,  the  higher  the  transport 
capacity  in  an  interference-limited  network  [3]. 

The  first  argument  stems  from  an  oversimplified  analysis  of 
the  energy  consumption  and  neglects  important  issues  such  as 
delay,  end-to-end  reliability,  and  bias  power  consumption.  The 
second  argument  is  only  valid  as  long  as  the  connectivity  of 
the  network  is  guaranteed;  it  was  derived  for  an  increasingly 
dense  network  that  takes  advantage  of  the  singularity  of  the 
attenuation  d~a  at  d  =  0,  which  may  lead  to  the  unrealis¬ 
tic  situation  that  the  received  power  exceeds  the  transmitter 
power;  and  it  neglects  delay,  too.  In  this  paper,  we  shed  more 
light  on  these  issues  by  listing  twelve  reasons  why  short-hop 
routing  is  not  as  beneficial  as  it  seems  to  be.  Some  of  the 
reasons  have  been  mentioned  in  other  work,  but  this  is,  to  the 


best  of  our  knowledge,  the  first  comprehensive  collection. 

Often,  a  disk  model 1  is  used  for  the  analysis  of  wireless 
networks,  where  a  transmission  is  either  100%  successful 
or  fails  completely,  depending  on  whether  the  distance  is 
smaller  or  larger  than  the  so-called  transmission  radius.  More 
realistic  is  the  threshold  model 2,  where  a  certain  signal-to- 
noise-and-interference  ratio  (SINR)  is  needed  for  successful 
transmission.  Still,  for  AWGN  channels,  the  threshold  model 
yields  0%  or  100%  probability  and  should  therefore  be  used 
with  great  care.  To  get  accurate  results,  reception  probabilities 
should  be  based  on  bit,  block,  and  packet  errors  rates,  taking 
into  account  the  error  correction  capabilities  of  the  channel 
code. 

We  will  demonstrate  that  by  discarding  the  disk  model  and 
directly  focusing  on  SINR  levels,  many  advantages  of  long- 
hop  routing  become  apparent. 

II.  Network  and  Link  Model 

Part  of  our  discussion  applies  to  many  types  and  classes  of 
networks  and  wireless  channels.  However,  to  be  concrete,  we 
often  focus  on  networks  with  random  node  distribution  and 
Rayleigh  fading  channels. 

A.  Node  Distribution  and  Generic  Routing 

The  analytical  results  are  derived  for  networks  whose  nodes 
constitute  a  Poisson  point  process  in  the  plane.  Note  that  for 
infinite  networks,  the  Poisson  point  process  corresponds  to  a 
uniform  distribution  [4],  [5],  and  for  large  networks,  the  two 
distributions  are  equivalent  for  all  practical  purposes. 

Many  different  routing  algorithms  exist  for  ad  hoc  networks 
[6],  [7],  but  common  to  all  of  them  is  the  fact  that  at  each  hop, 
progress  shall  be  made  towards  the  destination.  This  generic 
routing  strategy  is  illustrated  in  Fig.  1.  If  the  nearest  neighbor 
within  a  certain  sector  of  the  source-destination  axis  is  chosen 
as  the  next  relay,  this  is  certainly  an  instance  of  short-hop 
routing.  If  many  nearby  neighbors  are  skipped  and  a  node 

^lso  called  protocol  model  [3]. 

2 Also  denoted  as  physical  model  [3]. 
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Fig.  1.  Part  of  a  Rayleigh  network  with  the  source  at  the  origin  and  the 
x-axis  pointing  towards  the  destination  node.  R  denotes  the  distance  to  the 
nearest  neighbor  with  in  a  sector  </>  around  x,  and  ijj  is  its  argument.  Hence 
(. R ,  ip)  are  the  polar  coordinates  of  the  nearest  neighbor  within  a  sector  <p, 
and  ( X ,  Y)  are  its  Cartesian  coordinates. 


transmits  directly  to  a  more  distant  neighbor,  we  speak  of 
long -hop  routing. 

Note  that  the  distance  to  the  n-th  nearest  neighbor  that  lies 
within  the  desired  sector  can  easily  be  determined  from  the 
Poisson  assumption.  The  probability  density  of  the  distance  to 
the  n-th  neighbor  in  a  sector  0  is  [8] 


PRn(r)=r2n~1 


2 

(n-1)! 


e~r24>/2 


(1) 


Since  pru  is  a  Rayleigh  distribution  for  n  =  1,  it  can 
be  considered  a  generalized  Rayleigh  distribution.  Similarly, 
in  one  dimension,  the  Erlang  distribution  is  a  generalized 
exponential  distribution.  So,  the  transition  from  one  dimension 
to  two  dimensions  simply  entails  a  multiplication  by  r<f  (that 
comes  from  the  inner  derivative  of  the  exponential  part)  in  the 
distributions  of  the  node  distances. 


B.  Link  Model 


Although  some  of  the  reasons  listed  in  the  next  section  also 
pertain  to  the  disk  model,  we  will  mostly  use  a  Rayleigh  block 
fading  channel  model.  Assuming  that  a  certain  SINR  level 
0  is  required  for  successful  reception  (threshold  model),  the 
reception  probability  of  successful  packet  reception  can  be 
expressed  as  [8] 


Pi 


-exp(-w)-.ni 


(2) 


where  N  is  the  noise  power,  Pq  the  transmit  power  of  the 
desired  transmitter,  Pi,  i  >  0  the  transmit  powers  of  the  k 
interferers  (who  transmit  in  the  same  time  slot),  do  the  distance 
from  the  receiver  to  the  desired  transmitter,  and  di,  i  >  0,  the 


distance  to  the  interfering  transmitters,  a  is  the  large-scale  path 
loss  exponent.  In  the  expression  (2),  we  can  identify  two  parts: 

,  which  is  the  reception  probability  in  an  interference-free 
network  since  it  only  depends  on  the  noise,  and  pTr,  which  is 
the  reception  probability  in  a  noise-free  network  since  it  only 
depends  on  the  interference.  This  property  of  the  Rayleigh 
fading  channel  enables  the  separate  analysis  of  noise  and 
interference  effects. 

III.  The  Twelve  Reasons 

Clearly,  the  most  compelling  reason  against  short-hop  rout¬ 
ing  is  the  end-to-end  delay.  However,  we  do  not  consider 
delay  in  itself  an  argument  for  long-hop  routing,  since  energy 
and  delay  can  be  traded  off  against  each  other.  So,  for  a 
fair  comparison,  both  schemes  shall  satisfy  the  same  energy 
and  delay  requirements.  We  may  fix  the  energy  consumption 
and  search  for  the  protocol  with  the  smallest  delay,  or  we 
may  impose  a  delay  constraint  and  determine  which  protocol 
consumes  the  least  amount  of  energy. 

A.  Interference 

According  to  [9],  “It  is  unclear  whether  more  interference 
is  caused  by  a  single  transmission  at  higher  power  or  multiple 
transmissions  at  lower  power”.  Indeed,  a  shorter  transmission 
at  higher  power  may  permit  more  efficient  reuse  of  the 
communication  channel.  If  the  total  radiated  energy  (product 
of  power  and  duration)  of  is  a  good  indicator  for  interference, 
this  boils  down  to  an  energy  consumption  problem.  However, 
it  must  not  be  forgotten  that  the  SIR  does  not  depend  on 
absolute  power  levels.  If  all  nodes  scale  their  power  by  the 
same  factor  q  >  1,  all  the  SIR  levels  remain  constant,  but 
the  SINR  levels  will  increase,  which  can  be  easily  seen  from 
(2).  So,  increasing  all  transmit  power  levels  does  not  have 
a  negative  impact  on  any  packet  reception  probability  in  the 
network  (on  the  contrary),  in  stark  contrast  to  what  is  predicted 
by  the  disk  model.  This  indicates  that  long-hop  transmission 
does  not  inherently  cause  more  interference. 

B.  End-to-end  Reliability 

Under  the  disk  model,  reception  probabilities  are  either 
100%  or  0%.  If  every  receiver  is  in  its  desired  transmitter’s 
disk,  the  end-to-end  reliability  is  always  100%,  which  is 
clearly  not  realistic,  since  packet  errors  or  bit  errors  accu¬ 
mulate.  In  the  Rayleigh  block  fading  case,  the  end-to-end 
reception  probability  pn  over  an  n-hop  route  follows  from 
(2)! 

i>"=ne"e/,‘=e"eEf-,t  <3> 

i= 1 

where  7 i  denotes  the  mean  SNR  at  link  i.  So,  to  achieve  a 
desired  end-to-end  reliability  with  short-hop  routing,  the  relay 
nodes  need  to  transmit  at  a  higher  power.  This  compensates, 
at  least  partially,  for  the  loss  in  SNR. 


100 


Consider  the  case  of  a  one-dimensional  chain  of  equidis¬ 
tant  nodes  with  distance  d  (this  is  the  optimum  scenario 
for  short-hop  routing).  Let  Eo  be  the  energy  required  for 
a  transmission  over  distance  d  with  probability  Pee ,  i-e., 
Eo  :=  —daQN/  In  Pee-  Covering  the  total  distance  in  one 
single  hop  requires  an  energy  of  E\  =  naEo.  In  the  multihop 
case  with  n  hops,  a  reception  probability  pr  =  \JPee  is 
required  at  each  hop.  Since  \rPee  =  n  lnpr,  the  total  energy 
in  this  case  is  En  =  n •  n  Eo.  So,  for  a  =  2,  there  is  no  benefit 
in  multihop  routing. 

If  the  same  end-to-end  delay  is  permitted,  a  number  of  trans¬ 
mission  (at  lower  power)  are  possible  in  the  long-hop  case.  For 
the  block  fading  channel,  this  results  in  a  time  diversity  benefit, 
and  the  energy  advantage  of  short-hop  routing  vanishes  also 
for  higher  path  loss  exponents,  in  particular  for  high  end-to- 
end  reliabilities  or  when  channel  state  information  is  available 
at  the  transmitter. 

C.  Capacity  and  Channel  Coding 

For  optimum  coding  in  AWGN  channels,  a  change  from  a 
nominal  capacity  Co  :=  log2(l  +  Po/N)/2  to  Co/q  results  in 
an  energy  consumption  of  E(q)  =  qP(q)  =  qN(22C°/q  —  1), 
which  is  strictly  monotonically  decreasing  in  q?  Note  that 
Co/q  =:  R  is  the  information  theoretic  rate  (bits/symbol),  and 
that  the  gain  from  using  longer  codes  is  higher  for  higher  rates. 

Assume  that  a  (normalized)  distance  1  is  covered  by  an 
n-hop  route  for  some  small  n ,  say  1  ^  n  ^  5.  In  this  case, 
spatial  reuse  is  not  possible,  so  a  simple  TDMA  MAC  scheme 
will  perform  optimally.  Since  there  is  no  interference  in  this 
case,  the  analysis  is  based  on  AWGN  channels.  Let  R  denote 
the  bandwidth-normalized  end-to-end  rate  for  the  n-hop  route. 
We  find 

B=siofe(i+^”")'  (4> 

where  Es  =  E^R  is  the  energy  per  symbol  and  No  is  the 
noise  spectral  density.  By  dividing  the  distance  into  hops  of 
length  1/n,  the  SNR  increases  by  na.  On  the  other  hand,  the 
end-to-end  rate  is  reduced  by  a  factor  of  n.  Conversely,  the 
n-hop  scheme  needs  to  transmit  at  a  per-hop  rate  of  nR  to 
achieve  the  same  end-to-end  rate.  The  question  is  which  is 
the  optimum  n  given  a  certain  desired  end-to-end  rate.  This 
problem  is  addressed  in  some  detail  in  [10].  Here  we  note 
that  since  the  rate  loss  in  (4)  is  linear  while  the  gain  from  the 
increased  SNR  is  only  proportional  to  log2(n),  there  exists  a 
certain  rate  Rn  at  which  the  bit-energy-to-noise  ratio  E^/Nq 
is  the  same  for  n-hop  and  n  A  1-hop  routing.  Above  Rn ,  n-hop 
routing  performs  better,  and  below  Rn ,  (n  +  l)-hop  routing  is 
better.  Focusing  on  the  case  of  single-hop  and  two-hop  routing, 
we  find  from  (4)  that  R\  =  log2(2a  —  1)  <  a.  So,  as  a  simple 

3  The  increase  in  transmission  length  is  only  linear  in  q,  while  the  power 
can  be  reduced  exponentially. 


rule  of  thumb,  we  can  say  that  whenever  the  desired  rate  is 
higher  than  the  path  loss  exponent,  single-hop  routing  achieves 
the  highest  capacity. 

More  generally,  it  can  be  shown  that  for  R  — >  0,  there  exists 
an  asymptotic  per-hop  spectral  efficiency 


S  :=  lim  Rnopt{R)  (5) 

R — >0 

that  only  depends  on  a.  It  is  given  by 

_  W(— ae~a)  +  a  <  a(  1  -  e~a) 

In  2  “  In  2  ’  1  ’ 

where  W(-)  denotes  the  (principal  branch  of  the)  Lambert  W 
function  and  the  bound  stems  from  the  first  order  Taylor  ex¬ 
pansion.  The  primary  use  of  this  asymptotic  spectral  efficiency 
is  to  determine  the  optimum  hop  number:  It  can  be  shown  that 
with  good  accuracy 


novi(a,R)^[S{a)/R\,  (7) 


where  [x\  denotes  the  nearest  positive  integer  to  x.  Using  the 
upper  bound  (6),  we  find 


^opt (pi  R)  ^ 


"a(l  -e~a) 
R  In  2 


(8) 


which  conveniently  yields  the  optimum  number  of  hops  for 
any  R  and  a. 

If  channel  coding  is  taken  into  account,  multihop  is  further 
penalized  due  to  the  necessary  encoding  and  decoding  at  each 
hop. 


D.  Total  Energy  Consumption 

It  is  often  assumed  that  a  reduction  of  the  transmit  energy 
yields  a  proportional  reduction  of  the  total  energy  consump¬ 
tion.  Even  without  taking  into  account  receive  energy,  this 
is  not  true  for  any  practical  power  amplifier.  In  particular  in 
low-power  transceivers,  the  local  oscillators  and  bias  circuitry 
will  dominate,  so  that  short-hop  routing  does  not  yield  any 
energy  benefit  if  a  more  distant  relay  node  can  be  reached  with 
sufficient  reliability  [11].  For  random  networks,  relatively  high 
peak  power  levels  are  necessary  to  keep  the  network  connected 
[12],  and  short-hop  routing  would  require  a  substantial  backoff 
on  the  average,  resulting  in  poor  power  efficiency. 


E.  Path  Efficiency  in  Random  Networks 

Routes  in  random  networks  cannot  follow  straight  lines.  The 
path  efficiency,  defined  as  the  ratio  of  Euclidean  distance  of 
the  end  nodes  and  the  travelled  distance,  is  higher  if  longer 
hops  are  used.  Consider  the  generic  routing  strategy  in  Fig.  1. 


For  nearest-neighbor  routing  in  a 

sector  f  the  expected  path 

efficiency  r]  for  a  long  connection 

is  E[cos\P],  so 
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where  the  approximation  is  the  second-order  Taylor  expan¬ 
sion.  Now  assume  that  this  nearest-neighbor  path  has  been 
established,  and  that,  instead  of  routing  through  every  node, 
a  long-hop  strategy  is  applied  where  only  every  n-th  node  is 
used  as  a  relay.  With  increasing  n,  the  argument  4/  from  the 
origin  to  the  n-th  node  tends  to  be  Gaussian  distributed  with 
variance  V(4 ),  n).  Since  the  support  of  the  pdf  of  4/  is  always 
[—0/2,  0/2], 4  the  variance  decreases  inversely  proportional  to 
n,  i.e.,  F(0,  n)  ~  So,  for  large  n,  we  get5 


T]n  (0)  =  E[cOS^] 
\/6n 

~  J- 

_Al 
=  e  24 


f  cos  0  e  6n(  t )  d0 
J  —  oo 


1  - 


24n 


(ID 

(12) 

(13) 

(14) 


which  shows  that  the  path  efficiency  grows  monotonically  with 
n  and  goes  to  1  for  large  n.  The  energy  penalty  caused  by 
deviations  from  the  optimum  path  is 


1 

Vn((p)a 


1  + 


acp2 
24  n 


(15) 


This  shows  that  the  energy  penalty  can  be  made  independent 
of  a ,  if  only  every  n  =  a-th  node  is  used. 


F.  Sleep  Modi  or  Cooperation 

If  neighboring  nodes  are  not  used  as  relays,  they  can 
either  be  put  to  sleep,  or  they  can  assist  the  transmission 
by  cooperation  [13],  [14]  or  retransmission  ( e.g .,  if  an  ACK 
packet  is  not  received  by  the  source). 


G.  Routing  Overhead  and  Route  Maintenance 

In  [9],  it  is  pointed  out  that  (when  we  replace  a  larger 
number  of  short  hops  by  a  smaller  number  of  long  hops) 
“It  is  far  from  clear  what  happens  to  the  overall  transmission 
energy,  since  to  implement  a  nearest-neighbor  policy,  signifi¬ 
cantly  augmented  overhead  control  traffic  will  be  required  to 
coordinate  the  establishment  of  the  routing  paths  and  access 
control  protocols  across  the  entire  network.” 

In  a  first  order  approximation,  the  control  traffic  for  routing 
and  route  maintenance  is  proportional  to  the  number  of  nodes 
in  the  route.  Also,  the  probability  of  a  route  break  due  to 
energy  depletion  and  node  failure  clearly  increases  with  the 
number  of  nodes  involved,  as  well  as  the  memory  requirements 
for  the  routing  tables. 

4Hence,  after  every  convolution,  the  support  needs  to  be  scaled,  which 
results  in  a  reduction  of  the  variance. 

5  The  Gaussian  approximation  is  very  accurate  even  for  small  n.  For  </>  = 
7t/2  and  n  =  1,  e.g.,  the  precise  value  is  2y/2/7r  ~  0.9003,  whereas  this 
approximation  yields  e-7r  /96  «  0.9023,  so  the  error  is  only  0.2%.  The 
second  order  Taylor  expansions  are  identical,  even  for  n  =  1. 


Fig.  2.  The  critical  area  of  a  sensor  network  is  determined  by  the  nodes  that 
can  reach  the  base  station  in  a  single  hop.  If  it  consists  of  only  a  few  nearest 
neighbors  of  the  base  station,  these  nodes  will  severely  limit  the  lifetime  of 
the  network  since  they  have  to  relay  the  whole  traffi  c. 


H.  Route  Longevity  in  Mobile  Environments 

The  SNR  of  short-hop  routes  is  more  quickly  affected  by 
moving  nodes.  For  example,  if  a  node  at  distance  1  moves 
away  by  1  unit,  the  SNR  change  is  2a,  which  causes  the  link 
to  break  (unless  an  unreasonably  high  SNR  margin  is  applied). 
On  the  other  hand,  if  a  relay  node  is  3  units  away  moves  by 
the  same  distance,  the  SNR  change  is  only  (4/3) a,  which  can 
probably  be  tolerated. 

In  general,  the  lifetime  of  a  link  is  proportional  to  its  (orig¬ 
inal)  length.  To  show  this,  assume  that  routes  are  established 
with  a  certain  SNR  margin  M@,  such  that  for  a  link  of  initial 
distance  d,  we  have  =  0  M©.  At  the  critical  distance 
Ad  when  the  link  breaks,  we  have  (d+£d)aN  =  Solving 
for  Ad,  we  find 

Ad  =  d(M&/a  -  1) .  (16) 

For  a  given  mobility  pattern,  Ad  determines  the  lifetime  of  a 
link,  so  indeed  longer  links  live  proportionally  longer. 

I.  Traffic  Accumulation  and  Energy  Balancing 

For  certain  multihop  networks  such  as  sensor  networks  or 
multihop  cellular  networks,  traffic  accumulation  around  a  base 
station  (BS)  or  access  point  is  a  big  problem.  With  strict  short- 
hop  routing,  the  relaying  burden  cannot  be  distributed  among 
a  high  enough  number  of  nodes,  leading  to  a  critical  area 
around  the  BS  whose  nodes  suffer  from  a  short  lifetime  (see 
Fig.  2)  The  more  nodes  can  reach  the  BS  directly,  the  better 
distributed  the  load  can  be  [15]. 

J.  Variance  in  Hop  Length 

In  random  networks,  due  to  the  variance  in  hop  length, 
the  variance  in  energy  consumption  is  large  when  nearest- 
neighbor  schemes  are  used,  causing  substantial  imbalance 
in  the  energy  consumption.  Assuming  that  a  power  control 
scheme  is  used  that  adapts  the  transmit  power  to  maintain 
a  certain  reception  probability  p ^  (see  (2)).  For  an  n-hop 
route,  we  are  interested  in  the  power  consumption  at  the  node 
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that  has  to  transmit  over  the  largest  distance,  since  this  node 
determines  the  lifetime  of  the  route.  From  (1)  we  know  that 
the  distance  to  the  nearest  neighbor  is  Rayleigh  distributed. 
So,  for  a  path  loss  exponent  a  =  2,  the  power  level  is 
given  by  an  exponential  distribution,  and  we  need  to  determine 
the  expected  maximum  of  n  exponentially  distributed  random 
variables.  The  same  problem  appears  in  selection  combining 
of  multiple  transmission  in  fading  channels,  so  we  can  apply 
the  result  from  [16,  p.3 16]  that  this  maximum  is  given  by  the 
harmonic  sum,  i.  e. , 

M 

E[max{i?i, . . . ,  Rm}2}  =  ^  ^  ^  ln(M)  +  7em  ,  (17) 

fc= 1 

where  yem  «  0.577  is  the  Euler-Mascheroni  constant.  The 
bound  ln(M)  +  7em  is  tight  already  for  moderate  M,  so 
the  expected  maximum  E[max{i?i, . . . ,  Rm}2}  is  sharply 
lowerbounded by  E [R2]  (In  M+ 7em).  For  a  ^  2,  with  Jensen’s 
inequality,  we  have  E[(X2)a/2]  >  E[X2]a/2  and  thus 

E[ma x{Ru  . . . ,  RM}a]  >  (E[max{i?i, . . . ,  i?M}2])“/2 

(18) 

>  (E[J?2]  (In  M  +  7em))  a/2  , 
or,  since  E [R2]  =  (4/7r)E[i7]2, 

(lnM  +  7em)“/2. 

(19) 

For  a  =  1  (the  distance  itself),  we  have  the  upper  bound 

E[max{i?i, . . .  ,Rm}]  ~  \/E[i?2](ln  M  +  7em)-  (20) 

This  shows  that  the  maximum  energy  consumption  over  M 
nearest-neighbor  hops  grows  with  at  least  In  M  in  networks 
with  random  node  distribtution.  If  longer  hops  are  permitted, 
the  distances  and  thus  the  power  consumption  can  be  better 
equalized  among  the  nodes  in  a  route. 

K.  Bounded  Attenuation 

As  pointed  out  in  the  introduction,  a  path  loss  model  with 
a  singularity  at  distance  d  m  0  is  not  realistic  for  networks 
with  high  density.  Clearly,  there  is  a  bound  on  the  received 
power.  If  we  assume  that  this  bound  is  achieved  for  distances 
0  <  d  <  R,  then  there  is  no  benefit  of  using  shorter  hops  than 
R ,  since  the  interference  remains  constant  and  the  received 
power  does  not  increase  if  the  distance  is  decreased  further. 
This  problem  is  particularly  relevant  in  dense  networks  [17]. 

L.  Multicast  Advantage 

So  far,  we  have  only  addressed  unicast  routing.  For  multi¬ 
cast,  other  tradeoffs  between  short-hop  and  long-hop  routing 
exist.  In  particular,  as  discussed  in  [9],  it  is  often  advantageous 
for  a  source  to  transmit  at  high  power  levels  to  reach  a 
maximum  number  of  nodes  in  the  multicast  group. 


E[max{i2i, . . . ,  Rm }a] 


> 


=E[i?] 


IV.  Conclusion 

We  have  listed  twelve  compelling  reasons  why  the  tradeoff 
between  routing  over  many  short  hops  and  routing  over  fewer 
longer  hops  is  not  as  clear  as  it  is  often  assumed.  Not  all 
reasons  apply  to  all  types  of  networks,  of  course,  but  several 
of  them  will  be  relevant  for  most  networks.  The  conclusion  is 
that  routing  as  far  as  possible  is  a  very  competitive  strategy  in 
many  cases.  Conversely,  from  a  design  perspective,  the  peak 
transmit  power  should  be  chosen  such  that  a  node  can  reach 
well  beyond  nearest  neighbors. 
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Analysis  and  Design  of  Diversity  Schemes  for 
Ad  Hoc  Wireless  Networks 

Martin  Haenggi,  Senior  Member,  IEEE 


Abstract — Diversity  schemes  permit  efficient  communication 
over  fading  channels  but  are  often  hard  to  analyze  and  design  in 
networks  with  many  nodes.  For  Rayleigh-fading  channels,  there 
exists  an  interesting  relationship  between  resistive  circuits  and 
time  and  path  diversity  mechanisms  in  wireless  ad  hoc  networks. 
A  resistor-like  network  element,  the  erristor ,  representing  the  nor¬ 
malized  noise-to-signal  ratio,  is  introduced.  Given  an  end-to-end 
packet  delivery  probability,  the  logarithmic  mapping  from  link 
reception  probabilities  to  erristor  values  greatly  simplifies  the 
problems  of  power  allocation  and  the  selection  of  time  and  path 
diversity  schemes,  which  is  illustrated  in  a  number  of  examples. 
We  focus  on  transmission  strategies  with  selection  combining  and 
simple  noncoherent  “decode-and-forward”  strategies,  which  is 
motivated  by  their  practicality.  Thanks  to  its  conceptual  simplicity, 
the  formalism  that  is  developed  provides  valuable  insight  into  the 
benefits  of  diversity  mechanisms. 

Index  Terms — Ad  hoc  networks,  diversity  methods,  Rayleigh 
channels. 


I.  Introduction 

N  AD  HOC  wireless  networks,  energy  and  interference 
considerations  often  necessitate  multihop  routing ,  where 
nodes  also  act  as  routers,  forwarding  other  nodes’  packets  [1], 
[2].  Routing  schemes  that  were  developed  for  wired  networks 
will  perform  suboptimally  since  they  are  based  on  virtually 
error- free  point-to-point  links,  thereby  ignoring  two  funda¬ 
mental  properties  of  the  wireless  link:  1)  the  fragility  of  the 
channel  due  to  fading  and  interference  [2],  [3]  and  2)  the 
inherent  broadcast  property  of  wireless  transmissions.1  Fading 
can  be  an  advantage  if  transmissions  can  be  scheduled  oppor¬ 
tunistically  [4],  [5],  and  the  broadcast  property  can  be  exploited 
by  transmission  schemes  that  are  based  on  the  principle  of 
cooperative  diversity  [6],  where  nodes  coordinate  both  direct 
and  relayed  transmissions.  Cooperative  diversity  is  a  form  of 
spatial  diversity,  which,  in  the  case  of  static  single- antenna 
nodes,  reduces  to  path  diversity.  The  other  promising  strategy 
(in  the  case  of  narrowband  channels)  against  fading  is  time 
diversity,  which,  for  slow  (nonergodic)  fading  channels  and 
relatively  short  packets,  is  mainly  exploited  in  the  form  of 
retransmissions. 

For  the  analysis  of  multihop  packets  networks,  a  determin¬ 
istic  “disk  model”  is  often  used  [7]— [15],  where  the  radius  for 
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xWe  assume  that  omnidirectional  antennas  are  employed. 


a  successful  transmission  has  a  power-dependent  deterministic 
value,  irrespective  of  the  condition  of  the  wireless  channel. 
Interference  is  commonly  taken  into  account  using  the  same 
geometric  disk  abstraction.  The  stochastic  nature  of  the  fading 
channel  and,  thus,  the  fact  that  the  signal-to-noise-and-in- 
terference  ratio  (SINR)  is  a  random  variable  are  neglected. 
In  general,  the  volatility  of  the  channel  cannot  be  ignored 
in  wireless  networks  [2],  [3],  and  a  rigorous  assessment  of 
diversity  benefits  is  not  possible  with  a  deterministic  model. 
The  inaccuracy  of  the  “prevalent  all-or-nothing  model,”  as 
the  disk  model  is  called  in  [16],  has  also  been  pointed  out  in 
[17]  and  is  easily  demonstrated  experimentally  [18],  [19].  To 
overcome  some  of  these  limitations  of  the  disk  model,  we  em¬ 
ploy  a  Rayleigh-fading  link  model  that  relates  transmit  power, 
large-scale  path  loss,  and  the  success  of  a  transmission. 

Based  on  this  model,  we  present  a  simple  yet  powerful  for¬ 
malism  that  allows  an  efficient  analysis  and  design  of  time  and 
path  diversity  strategies  with  selection  combining  for  Rayleigh- 
fading  channels.  In  the  analysis  problem,  the  transmit  power 
levels  are  given  and  the  end-to-end  reliability  pee  is  to  be  de¬ 
termined,  whereas  in  the  (more  interesting)  design  problem, 
we  assume  that  the  application  dictates  a  certain  end-to-end 
reliability  p^,  and  the  question  is  how  to  choose  the  transmit 
powers,  the  relays  (paths),  and  the  number  of  transmissions  over 
each  link  in  order  to  minimize  energy  consumption  and/or  max¬ 
imize  network  lifetime  under  the  constraint  pee  >  Pd-  We 
focus  on  noncoherent  “decode-and-forward”  strategies,  which 
are,  in  contrast  to  coherent  combining,  straightforward  to  imple¬ 
ment.  Nonetheless,  they  exhibit  greatly  superior  performance  to 
schemes  without  diversity,  in  particular  at  high  SNR. 

The  rest  of  this  paper  is  organized  as  follows.  In  Section  II, 
the  link  model  is  presented.  Section  III  introduces  the  “erristor” 
framework,  where  transmissions  are  characterized  by  a  resistor¬ 
like  circuit  element  whose  value  equals  the  normalized  noise-to- 
signal  ratio  at  the  receiver,  and  discusses  its  use  for  multihop 
routes  and  time  diversity  schemes.  Section  IV  focuses  on  path 
diversity  schemes  and  presents  several  examples,  and  Section  V 
concludes  the  paper. 

II.  Link  Model 

We  assume  a  narrowband  multipath  channel,  modeled  as 
a  Rayleigh  block  fading  channel  [20]  with  an  additive  white 
Gaussian  noise  (AWGN)  process  z.  The  received  signal  at  time 
k  is  yk  =  CLk%k  +  where  a,k  is  the  large-scale  path  loss 
multiplied  by  the  fading  coefficient.  The  variance  of  the  noise 
process  is  denoted  by  Aq. 

The  transmission  from  node  i  to  node  j  is  successful  if  the 
SINR  7  is  above  a  certain  threshold  ©  that  is  determined  by 
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the  communication  hardware  and  the  modulation  and  coding 
scheme  [2]. 

With  the  assumptions  above,  the  received  power  Q  is  expo¬ 
nentially  distributed  with  mean  Q  =  Pod~a,  where  Pq  is  pro¬ 
portional  to  the  transmit  power,2  d  —  \\xi  —  Xj\\2  the  distance 
between  transmitter  and  receiver,  and  a  the  large-scale  path  loss 
exponent.  I  is  the  interference  power  affecting  the  transmis¬ 
sion,  i.e.,  the  sum  of  the  received  powers  of  all  the  undesired 
transmitters. 

The  following  theorem  shows  that  for  Rayleigh-fading  net¬ 
works,  it  is  possible  to  analyze  noise  and  interference  separately. 

Theorem  1:  In  a  Rayleigh-fading  network,  the  reception 
probability  pr  =  P[ 7  >  ©]  can  be  factorized  into  the  reception 
probability  of  a  zero-interference  network  p^  and  the  reception 
probability  of  a  zero-noise  network  pTr  as  follows: 


traffic  (pi  — >  1).  For  the  heavy  traffic  case,  a  separate  interfer¬ 
ence  analysis  would  have  to  be  carried  out,  as  in  [22]  and  [23], 
and  the  resultant  pTr  would  have  to  be  multiplied  by  the  p^f  de¬ 
rived  here. 

In  a  zero-interference  network,  the  reception  probability  over 
a  link  of  distance  d  at  a  transmit  power  Pq,  is  given  by 

&N0 

pr  :=  P[7iv  >  0]  =  e  por“  .  (2) 

Solving  for  Pq,  we  get  for  the  necessary  transmit  power  to 
achieve  pr 


daGN0 

—  In  pr  * 


(3) 


Note  that  for  a  — ►  00,  the  fading  model  turns  into  a  disk  model. 
For  d  <  1,  the  reception  probability  is  one,  for  d  >  1,  it 
is  zero.  Similarly,  for  the  interference  term  in  (1),  if  all  the 
interferers  are  further  away  than  the  desired  transmitter,  i.e., 
dQ  <  min{<ii, . . . ,  dk},  reception  is  guaranteed.  Otherwise,  re¬ 
ception  is  impossible. 


where  dQ  is  the  distance  of  the  desired  transmitter,  di  ,1  <  i  <  k 
are  the  distances  of  the  k  interferers,  and  Pj  are  their  transmit 
power  levels. 

Proof:  Let  Qq  denote  the  received  power  from  the  desired 
source  and  Qi,  i  =  1, . . . ,  k,  the  received  power  from  k  in¬ 
terferers.  All  the  received  powers  are  exponentially  distributed, 
i.e.,  pq.  (qi)  =  1  /Qie~ri^i ,  where  Qi  denotes  the  average  re¬ 
ceived  power  Qi  =  Pid~a.  Conditioned  on  I,  the  probability 
of  correct  reception  is  simply  an  exponential,  so  we  have 


Pr  =  E 1 

=  E  j 


P[Oo>B(/  +  7V0)|7] 

©(/  +  Nq) 


exp 


Q  0 


k 

x  II  PoMi)  Mi  ■  ■  ■  Mk 

i=  1 


III.  Erristor  Representation 

A.  Connections  Without  Retransmission 

Over  an  n-hop  connection  from  node  0  to  node  n  in  an  ad  hoc 
network,  the  reception  probability  (end-to-end  reliability)  is 

Vn  J- 

Pee  =  e  £&= 1  n  (4) 

where  7 |  denotes  the  mean  SNR  at  receiver  i.  Let  R  denote  the 
normalized  average  noise-to- signal  ratio  (NSR)  at  the  receiver, 
i.e.,  R  :=  6/7.  We  get 


n 

—  In  pee  =  ^  Ri  =  Rt.o$~  (3) 

i=  1 

With  pd  the  desired  end-to-end  reliability,  the  condition  pee  > 
Pd  translates  into  the  condition  that  the  sum  or  the  series  con¬ 
nection 4  of  the  NSR  values  Ri  is  at  most  R&  :=  —  lnpo-  So, 
the  individual  Ri  ’  s  can  be  replaced  by  an  equivalent  Rtot .  For 
a  single  link,  we  have 


from  which  (1)  follows.3  ■ 

This  allows  an  independent  analysis  of  the  effect  caused  by 
noise  and  the  effect  caused  by  interference.  Since  we  are  con¬ 
cerned  with  the  fundamental  energy  benefits  of  diversity,  the 
focus  of  this  paper  is  the  noise,  i.e.,  on  the  first  factor  in  (1). 
Clearly,  pi  is  invariant  under  (global)  power  scaling,  so  it  does 
not  give  an  indication  at  which  power  levels  to  transmit.  The  re¬ 
sults  that  are  derived  are  directly  applicable  to  the  case  of  light 


2This  equation  does  not  hold  for  very  small  distances.  So,  a  more  accurate 
model  would  be  Q  =  Pq  •  (d/ d0)~a,  valid  for  d  >  d0,  with  Pq  as  the  average 
value  at  the  reference  point  d0 ,  which  should  be  in  the  far  field  of  the  transmit 
antenna.  At  916  MHz,  for  example,  the  near  field  may  extend  up  to  3-4  ft  (sev¬ 
eral  wavelengths). 


R  =  —  Inpr  4=^  pr  =  e  R.  (6) 

For  probabilities  close  to  1  (or  R  <C  1),  the  following  first-order 
approximations  are  accurate 

R  :=  1  —  pr  ~  R  pr  :=  1  —  R  ~  pr.  (7) 

This  approximation  shows  that  for  small  values,  the  NSR  can 
be  considered  equivalent  to  the  packet  loss  probability.  To  em¬ 
phasize  this  fact  and  the  resistor-like  series  connection  property 
of  the  NSR,  we  denote  R  as  an  “erristor”  and  its  value  as  its 
“erristance.”  It  follows  from  (5)  that  over  a  multihop  connection, 
the  noise  accumulates  and  the  packet  loss  probabilities  simply 
add  up. 


3 A  similar  calculation  has  been  carried  out  in  [21,  Appendix]  for  a  network  4In  terms  of  the  SNR  values,  this  corresponds  to  a  parallel  connection,  which 
with  spreading  gain  and  equal  transmit  powers  for  all  nodes.  *  '“''was  pointed  out  in  [24]. 
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Fig.  2.  (Left)  A  two-hop  connection  with  two  transmissions  over  the  second 
hop.  (Right)  Corresponding  erristor  circuit. 


Fig.  1.  (a)  Three-hop  connection  with  link  reception  probabilities  pi7p2, 

and  p3  and  the  erristor  circuit,  (b)  The  erristor  values  are  the  normalized 
noise-to-signal  ratios  RllR2,  and  R3.  Ri  =  —  In pi. 


The  relationship  between  the  transmit  power  and  P  is 


P0  =  dajN0  = 


daGN0 

R 


(8) 


Henceforth,  P  :=  da/R  denotes  the  normalized  (by  ©TVq) 
transmit  power. 

Example  1:  Fig.  1(a)  shows  an  example  with  three  links  and 
their  reception  probabilities.  From  (5),  we  know  that  a  series  of 
hops  translates  into  a  series  connection  of  erristors,  hence,  we 
find  the  corresponding  erristor  network  in  Fig.  1(b).  For  p&  = 
90%,  for  example,  the  value  of  Pi  +  R2  +  P3  must  be  at  most 
Rd  =  —  InpD  ~  0.105.  If  all  the  power  levels  are  equal,  this 
can  be  achieved  by  setting  Pi  =  P2  =  P3  =  0.105/3  =  0.035. 
A  possible  solution  with  unequal  power  is  Ri  =  R2  =  0.05 
and  P3  =  0.005.  In  this  case,  the  probability  after  two  links  is 
e-0.05e-0.05  po  90.5%,  which  is  already  close  to  90%.  Con¬ 
sequently,  a  high  amount  of  energy  is  consumed  at  the  third 

link  to  ensure  packet  reception  with  the  required  probability 
g— 0.005  =  99  5% 

If  the  internode  distances  di  (between  node  i  —  1  and  node  i) 
are  given,  a  solution  can  be  determined  that  ensures  that  all  the 
transmit  power  levels  have  the  same  value  P.  From  (d({  4f  d%  + 
d%)/P  =  —  lnpD>  we  get 


P  = 


df  +  d%  +  d% 
-In  Pd 


(9) 


For  di  —  ifot  =  2,  and  pd  =  90%,  for  example,  we  get  P  «  14- 
9.5  =  133  and  Pi  «  0.0075,  P2  «  0.03,  and  P3  «  0.0677.  □ 


B.  Connections  With  Time  Diversity  ( Retransmissions ) 

In  general,  with  n  transmissions  over  the  same  link  at  NSR 
levels  Ri  and  selection  combining,  the  total  reception  proba¬ 
bility  is 


Pn  =  1  "IF1  -fr/?  )•  (10) 

7-1 


So,  erristors  connected  in  series  behave  like  regular  resistors, 
whereas  the  values  of  erristors  connected  in  parallel  (retrans¬ 
missions)  have  to  be  multiplied.  Due  to  the  bound  (11),  the  re¬ 
sultant  end-to-end  reliability  will  be  slightly  higher  than  the  one 
specified. 

Example  2:  Consider  the  simple  example  in  Fig.  2.  A  packet 
is  transmitted  with  reception  probability  poi  over  the  first  hop 
and  transmitted  twice  over  the  second  hop,  with  probabilities 
Pi2,i  and  pi2,2,  respectively.  The  end-to-end  reliability  of  the 
connection  is  pEE  =  p(n  •  (1  -  (1  -  Pi2,i)(l  -  £12,2))-  Let 
pD  =  90%.  How  to  allocate  the  transmit  powers? 

For  the  second  link,  we  have  pi2  =  1  —  (1  —  P12/1  )(1  — 
Pi2,2)  =  1  —  (1  —  e_i?12’1)(l  —  e_i?12’2).  From  Theorem 
2,  e_i?12  li?12’2  is  a  lower  bound  for  p12,  and  for  JJj  <  1 
and  P2  <C  1,  the  bound  is  tight.  Thus,  we  may  replace  Pi2?i 
and  Pi2?2  by  P2  =  Pi2,iPi2,2-  In  the  erristor  diagram, 
the  two  transmissions  are  illustrated  by  a  parallel  connec¬ 
tion  (see  Fig.  2).  From  Pi  +  P2  <  Rp  =  0.1,  a  possible 
choice  is  Pi  =  P2  =  0.05  and  Pi2?i  =  Pi2?2  =  a/5/10. 
Thanks  to  the  retransmission,  node  2  consumes  less  than 
half  the  energy  of  node  1,  assuming  the  distances  are  the 
same.  An  allocation  where  both  nodes  use  the  same  power  is 
Pi  =  a/110/5  —  2  «  0.098,  and  Pi2?i  =  Pi2,2  =  2Pi.  This 
way,  node  1  benefits  from  node  2’s  retransmission  and  uses 
almost  only  half  the  power.  □ 

For  n  transmissions  with  the  same  power  level  P,  the  dif¬ 
ference  between  the  precise  probability  value  1  —  (1  —  e~R)n 
and  the  lower  bound  e~Rn  is  plotted  in  Fig.  3.  The  erristance 
threshold  where  the  bound  is  within  1%  is  P  =  0.236  for  n  =  2 
and  P  =  0.375  for  n  =  4.  Thus,  for  P  <  1/4 (p  >  78%),  the 
bound  is  sufficiently  tight  for  all  practical  purposes.  For  values 
P  >  1,  the  bound  is  loose,  and  the  multiplication  would  not 
make  sense,  since  the  overall  erristance  increases,  although,  of 
course,  even  a  retransmission  with  low  power  still  leads  to  an 
improvement  in  the  total  link  reception  probability.  However, 
for  P  ~  1/2,  a  single  transmission  outperforms  splitting  the 
power  into  two  transmissions.  For  two  transmissions  at  NSR 
2 P,  the  reception  probability  is  pr  =  1  —  (1  —  e~2R )2,  whereas 
for  a  single  transmission  at  NSR  P,  we  get  p'r  =  e~R.  The  two 
probabilities  are  equal  for 


To  derive  a  general  rule  for  the  simplification  of  these  expres¬ 
sions,  we  apply  the  following  theorem. 

Theorem  2:  For  (.7*1,  c2, . . . E  (^o~)n 

So,  for  P  ~  1/2,  the  reception  probability  is  higher  for  a 
single  transmission  at  NSR  P.  From  a  practical  point  of  view,  re¬ 
ception  probabilities  lower  than  37%  may  not  be  worth  spending 
energy  at  the  receiver.  By  excluding  erristances  greater  than  one, 
with  equality  if  and  only  if  n/=1  xi  —  0.  this  is  taken  into  account,  and,  at  the  same  time,  the  validity  of 

The  proof  is  presented  in  the  Appendix.  10&  e  multiplication  property  is  preserved. 


(11) 

7-1 


R  =  In  2  -  In  (A  -  l)  «  0.48.  (12) 
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Erristance 

Fig.  3.  Difference  between  the  exact  probability  and  the  lower  bound  for  n  = 
2,4,6,8,10  transmissions  with  equal  erristance  R. 


(b) 


Fig.  4.  (a)  Network  that  exploits  path  diversity  and  (b)  its  erristor  circuit. 

IV.  Path  Diversity 
A.  Transmissions  Over  Independent  Paths 

With  the  addition  and  multiplication  property  of  the  erristors, 
we  have  all  the  ingredients  to  analyze  and  design  path  diversity 
schemes. 

Example  3:  Fig.  4  displays  an  example  of  a  network,  where 
path  independence  is  guaranteed  even  when  the  channels  have  a 
long  coherence  time.  By  conventional  analysis,  the  end-to-end 
reception  probability  is 

Pee  =  (1  -  (1  -  e~Roie~R^)(l  -  e~R°2)) 

•  (1  -  (1  -  e~R23e~R34)(l  -  e~R24)).  (13) 

How  to  choose  the  i^-’s  to  guarantee  pee  >  pd  =  95% ?  The 
equivalent  erristance  [Fig.  4(b)]  is  Rto t  =  (Roi  +  ^12)^02  + 
(R23  +  ^34)^24,  and  pee  =  e~Rtot.  For  pD  =  95%,  we  have 
Rtot  «  0.05.  Thanks  to  the  symmetry,  (Rqi  +  ^12)^02  = 
(R23  +  -#34)^24  =  0.025  is  a  solution;  hence,  we  may  set 
nearest-neighbor  hops  to  Ri-i^i  =  0.05  and  R02  =  R24  = 
0.25.  Note  that  the  value  for  the  two  longer  hops  is  five  times 
bigger,  which  means  that  the  necessary  transmit  powers  are 
comparable  if  the  nodes  have  equal  distances  and  the  path  loss 
exponent  is  between  2  and  3.  So,  the  diversity  scheme  and  the 
power  allocation  guarantee  pee  =  e-0  05  «  95.1%  >  po- 


Fig.  5.  Erristor  circuit  for  example  3  (see  Fig.  4)  including  implicit 
transmissions.  Implicit  erristors  are  shaded  gray. 


reallocation  of  resources,  if  necessary.  Assume  node  3  runs  out 
of  energy.  With  R^  — >  00,  we  see  immediately  that  R23  be¬ 
comes  useless,  and  the  only  path  in  the  lower  half  of  the  dia¬ 
gram  will  be  the  one  with  R2 4.  What  value  does  R24  need  to 
have  to  ensure  pd?  Without  changing  the  other  erristances,  we 
find  R24  =  0.025.  □ 

The  total  normalized  energy  consumption  (per  packet)  at  each 
node  can  easily  be  determined 


v — ^  tZ  • 


Rij 
3= 1 


where  m  is  the  number  of  outgoing  paths  from  node  i. 


B.  Implicit  Transmissions 

In  example  2  (Fig.  2),  if  node  2  listens  to  the  transmission 
from  node  0  to  node  1,  then  this  implicit  transmission  has  to 
be  modeled  by  an  additional  erristor  for  an  accurate  analysis. 
This  implicit  erristor  is  free  in  terms  of  transmit  power  (but  still 
requires  power  to  receive  the  packet);  it  represents  the  benefit 
from  the  wireless  medium,  often  denoted  as  the  wireless  multi¬ 
cast  advantage  [2],  [25]. 

Example  2  ( Continued ):  Assume  pd  =  99%,  so  Rtot  « 
0.01.  This  is  achieved  by  setting  R0 1  =  0.005  and  R\2,i  = 
^12,2  =  0.07.  However,  since  there  is  an  implicit  transmis¬ 
sion  from  0  to  2,  there  is  an  erristor  in  parallel  with  a  value 
of  Rq2  =  ^01(^02/^01)^  (the  superscript  i  indicates  an  im¬ 
plicit  transmission).  Assuming  do 2  =  2doi  and  a  =  3,  we  get 
R02  =  0.04,  jRtot  =  0-01  •  0-04,  and  pee  ~  99.96%,  which  is 
much  better  than  the  target  of  99%.  So  we  can  reduce  R01  to  a 
value  that  guarantees  (R01  +  0.005)i?oi  •  8  =  0.01.  Solving  the 
resulting  quadratic  equation  yields  Roi  «  1/30,  which  corre¬ 
sponds  to  less  than  1/6  of  the  original  power.  □ 

For  large  path  loss  exponents  and/or  small  transmit  power 
levels,  the  benefit  to  listeners  that  are  farther  away  than  the  in¬ 
tended  receiver  becomes  small,  since  the  implicit  erristances 
will  be  close  to  one  or  even  above.  However,  if  the  implicit  re¬ 
ceiver  is  closer  than  the  intended  one  or  if  the  transmit  power 
is  relatively  high,  it  is  worthwhile  having  the  nodes  awake  and 
listening. 

Example  3  ( Continued ):  In  example  3  (Fig.  4),  there  is  an 
implicit  transmission  from  node  2  to  node  3  when  node  2  is 
transmitting  to  node  4.  If  node  3  ignores  this  transmission,  then 
the  previous  analysis  was  correct.  If  it  takes  advantage  of  that 
information,  we  have  to  add  another  erristor  R\^  as  shown  in 

Fig.  5.  Analogously,  Rx01  models  the  implicit  transmission  01 
that  occurs  due  to  the  explicit  transmission  02 . 


The  conventional  analysis  (13)  yields  pee  ~  95.8%  which  is,  Assuming  equal  distances  between  neighboring  nodes, 
as  expected,  slightly  larger.  The  formalism  also  permits  a  rapid  =  ^02  and  R\ 3  =  2~aR24.  For  a  =  2  and  using  the 
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Roi 

(b) 


Fig.  6.  (a)  Simple  cooperative  scheme  and  (b)  the  corresponding  erristor 

circuit.  Node  C  is  relaying  a  packet  from  node  0  to  assist  node  0.  The  distances 
0 C  and  Cl  are  d  =  1/  cos  + 


same  values  as  before,  =  0.05,  and  Rq2  =  R24  =  0.25, 

we  find  Rqi  =  R\ 3  =  0.0625  and 


Rtot  -  2  (  20  •  16  +  20 


resulting  in  pee  ~  97.3%,  which  is  larger  than  the  target  of 
95%.  Considering  that  1/16  1,  we  may  try  to  omit  the 

explicit  transmission  01  completely,  which  results  in  Rtot  = 
(1/16  + 1/20)/2  «  0.056  and  pee  =  94.4%.  A  slight  decrease 
of  R02  and  R2 4  by  10%  each  brings  pee  to  95.2%.  □ 

The  next  example  demonstrates  how  the  erristor  formalism 
transforms  complex  logarithmic  relationships  into  simple  poly¬ 
nomial  ones,  which  are  analytically  tractable. 

Example  4.  A  Simple  Cooperative  Scheme:  In  Fig.  6,  a  situ¬ 
ation  is  shown  where  node  0  wants  to  transmit  to  node  1,  and 
the  cooperative  node  C  may  help  as  a  relay.  From  the  erristor 
circuit,  it  can  be  seen  that  there  is  no  explicit  transmission  from 
0  to  C.  The  goal  is  to  determine  which  transmission  strategy 
minimizes  the  total  transmit  energy  Et ot  given  a  certain  total 
erristance  Rto t- 

Let  D  :=  (d/2)a.  With  i?,tot  =  Roi(Rloc+Rci)  and  Rloc  = 
DRqi,  we  get  i?tot  =  An  (An  D+Rci )  and  Etot  =  2a/i?oi  + 
da/RCi  =  2“(i+  +  DRcD- 
Strategy  A — Equal  Received  Power.  A  possible  strategy  is  to 
have  C  transmit  at  a  power  level  that  makes  the  received  power 
at  node  1  equal  to  that  from  the  direct  transmission  01,  i.e., 
Rci  —  Roi  =:  R.  Rtot  simplifies  to  Rtot  =  R2(l  +  D)  and, 
thus,  R  =  v  {Rtot) /(I  +  D).  Inserting  this  expression  into  the 
transmit  energy  Etot  =  (2a)/(R)(l  +  D )  yields 


Kt  = 


41  +  -P)3/2 

\J Aot 


Strategy  B — Equal  Transmit  Power.  Here,  we  assume  that 


both  node  0  and  C  use  the  same  transmit  power.  With  R  := 
Roi  and  Rci  =  RqC  =  RD ,  we  have  Rtot  =  2 DR2  and"! 


0  75-2  1 


Fig.  7.  Visualization  of  the  regions  in  the  (D,  J )  plane  where  the  different 
strategies  are  optimum.  Note  that  the  curves  D/S  and  (1  +  D)3  intersect  at  the 
points  (\/5  —  2,  (+5  —  l)3)  and  (1,8). 


R  =  (i?tot)/(2i9).  The  total  energy  consumption  is  simply 

Etot  =  2  •  2a/R,  or,  as  a  function  of  Rtot 

^t=2“+l/S-  (i7) 

The  energy  consumption  ratio  of  strategies  A  and  B  is 

,  .  £tBot  2V7P 
P  '  E&  t  (l  +  D)3/2- 

p  =  1  for  D  =  1  and D  =  y/h-2  «  0.236.  For  ^5-2  <  i9  < 
1,  strategy  A  is  preferable  (p  >  1).  The  maximum  p,  however, 
is  only  (4/9)\/6  «  1.089,  occurring  at  D  —  1/2.  So,  strategy 
A  is  at  most  8.9%  better. 


To  get  a  complete  view,  we  also  discuss  the  case  of  a  direct 
one-hop  transmission  and  an  explicit  two-hop  scheme  without 
a  direct  path  from  0  to  1 .  For  the  one-hop  case,  we  have  E™te  = 
2a/Rtot ,  and  for  the  two-hop  case  (assuming  equal  transmit 
powers),  there  are  two  erristors  in  series  with  value  i?tot/2  and, 
thus,  Et™  =  4 da/Rtot  =  4 DEZe- 

The  one-hop  strategy  is  better  if  4 D  >  1,  or,  in  terms  of  the 
actual  distance  d,  d  >  21_2/+  So,  for  a  =  2  and  for  d  >  2, 
one-hop  is  always  better,  even  for  a  •-+  00,  which  is  easily  ex¬ 
plained,  since  node  C  is  then  not  closer  than  node  1 .  As  a  func¬ 
tion  of  the  angle  </>  =  arccos(l/<i),  the  condition  for  one-hop 
to  be  better  is  expressed  as  </>  >  arccos(22/a_1).  For  a  — ►  00, 
the  critical  angle  is  </>  =  7r/3  (corresponding  to  an  equilateral 
triangle  0C1),  as  expected. 

The  last  step  is  the  comparison  of  these  simple  schemes  with 
the  cooperative  strategies  A  and  B.  First,  we  note  that  B  always 
outperforms  the  two-hop  scheme,  since  it  exploits  “free”  infor¬ 
mation  that  is  transmitted  over  the  direct  path.  The  tournament 
between  A  and  one-hop  is  won  by  A  if  <  (1  +  D)~3,  and 
B  wins  against  one-hop  if  Rtot  <  (S/D).  The  resulting  divi¬ 
sion  of  the  (D,  jR^) -plane  in  the  different  strategies  is  shown 
in  Fig.  7.  Since  is  related  to  the  SNR,  Fig.  7  suggests  that 
cooperation  is  beneficial  at  high  SNR  and  relatively  small  dis¬ 
tances  D,  in  agreement  with  intuition.  □ 

Example  5.  Virtual  Antenna  Arrays:  Several  nodes  that  are 
close  may  cooperate  and  act  as  a  virtual  antenna  array,  ex¬ 
ploiting  spatial  diversity.  The  performance  of  such  arrays  was 
mialyzed  in  [26]  from  an  information-theoretic  perspective. 
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Fig.  8.  (a)  Virtual  antenna  scheme  and  (b)  its  erristor  circuit. 

Here,  not  assuming  any  form  of  lower-layer  coordination  within 
the  cluster,  we  apply  the  erristor  formalism  to  compare  these 
schemes  with  conventional  multihop  routing.  Fig.  8  shows  the 
erristor  diagram  of  a  simple  scenario  with  two  nodes  assisting 
each  other  at  the  source,  in  the  middle,  and  at  the  destination. 
So,  instead  of  individual  nodes,  we  have  clusters  of  two  nodes 
at  positions  0,  1,  and  2.  It  is  assumed  that  the  intracluster 
distances  are  much  smaller  than  the  intercluster  distances  d/2. 
When  the  source  node  in  cluster  0  is  transmitting  to  cluster  1 , 
his  peer  will  receive  that  packet  with  probability  (almost)  one 
since  Rq0  Roi-  In  the  next  time  slot,  this  peer  node  will 
transmit  the  same  packet  to  cluster  1 .  Hence,  the  same  packet 
is  delivered  over  four  different  paths.  Similarly,  cluster  1  relays 
the  packet  to  cluster  2  over  four  paths.  In  the  case  that  the  actual 
destination  node  itself  in  cluster  2  does  not  correctly  receive  the 
packet,  an  additional  short  intracluster  transmission  is  required, 
whose  energy  is  neglected  in  the  following  analysis. 

We  assume  Roi  =  R\2  •=  R •  For  the  diversity  scheme,  with 
Rq0  <  R ,  we  get  Rtot  =  2 R4  and 


o 

Cl 


(b) 

Fig.  9.  (a)  Region  in  the  (a,pD)  plane  where  the  diversity  scheme 

outperforms  conventional  multihop  routing,  (b)  Energy  gain  as  a  function  of 
the  end-to-end  probability. 


This  diversity  scheme  can  be  generalized  to  clusters  of  size 
m  that  transmit  over  n  hops.  In  this  case,  Rtot  =  nR m  and 


Etnt  —  4 


W  2)« 


R  V2 J  \RtotJ 

For  comparison,  for  a  four-hop  connection  with  hops  of  length 
d/ 4,  we  have  Rtot  =  4 R'  and 


The  ratio  between  the  two  energies  is 

Etot  nr* — x  T-) 


=  2^-1  R* 


Hence,  the  diversity  scheme  is  more  efficient  for 

-4a+7  /  -4a+7 

Rtot  <2s  or  pD  >  exp  -2  s 


For  the  multihop  scheme  with  mn  hops,5  Rtot  =  mnR'  and 


Etot  =  mn 


The  ratio  is 


'  nm 2  1rna 


This  curve  is  plotted  in  Fig.  9(a).  Substantial  energy  gains  are 
possible  for  high  pD  [see  Fig.  9(b)].  When  the  path  loss  expo¬ 
nent  increases  by  one,  the  energy  gain  decreases  by  3  dB.  ^ 


from  which  we  see  that  the  energy  gain  is  maximized  for  m  =  2 
(except  for  a  =  2,  where  m  =  3  performs  slightly  better) 
and  increases  almost  linearly  in  n.  We  conclude  that  for  high 
Pd  and  a  not  too  big,  the  diversity  scheme  clearly  outperforms 
conventional  multihop  routing.  □ 

,q  5This  comparison  is  fair  both  in  terms  of  the  number  of  nodes  involved  and 
t3i  the  delay,  since  the  total  number  of  transmissions  is  m  n  for  both  schemes. 
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Fig.  10.  (Top)  Six  nodes  of  a  two-level  leap-frog  scheme.  Every  nodes  listens 
to  the  transmissions  of  the  two  previous  nodes.  (Bottom)  A  conservative 
simplified  erristor  circuit. 

Example  6.  Long  Line  Networks  with  Equidistant  Nodes: 
Consider  a  long  line  network  with  equidistant  nodes  of  distance 
d—  1,  with  the  source  at  the  left  end  and  the  destination  at  the 
right  end.  Every  node  transmits  once  but  listens  to  the  trans¬ 
missions  of  m  of  its  left  neighbors,  resulting  in  m  —  1  implicit 
transmissions.  The  corresponding  erristor  circuit  for  m  =  2 
is  shown  in  Fig.  10.  Due  to  the  long  chain  of  dependencies, 
such  a  leap-frog  scheme6  is  difficult  to  analyze;  we,  therefore, 
simplify  it  by  attaching  the  implicit  transmission  to  node  n  +  1 
rather  than  node  n  (Fig.  10).  This  yields  an  upper  bound  on 
the  erristance,  since  the  accumulated  erristance  at  node  n  is 
certainly  smaller  than  at  node  n  +  1.  The  per-hop  erristance  R 
is  then 


Fig.  11.  Diversity  gain  from  listening  to  m  transmissions  as  a  function  of  the 
single-transmission  erristance  R1  for  a  —  2,  3, 4,  5 .  m  —  min{5,  } . 

The  mean  of  R  is  the  ath  moment  of  an  exponential  random 
variable  (RV),  so  we  have  E[P]  =  T(a  + 1)  or,  if  a  is  an  integer, 
E[R\  =  a\.  For  P/l,  this  is  to  be  divided  by  P. 

For  a  two-neighbor  cooperative  scheme  (m  =  2),  the  ex¬ 
pected  per-hop  erristance  is  P2  =  E[Df  (Di  +  D2)a],  and  in 
general,  with  transmit  power  P,  we  have 


R  =  Pi  •  2aR1  •  3aR1  •  . . .  •  maR1  =  R^(m\)a.  (26) 

The  choice  m  =  \_R~[1^a\  ensures  that  nodes  do  not  listen  to 
transmissions  with  low  reliability.  The  benefit  compared  with 
conventional  multihop  routing  is 

^  =  R\~m  •  ( m\)~a  (27) 

which  is  enormous,  in  particular  at  high  SNR  (low  Pi),  as  it  is 
illustrated  in  Fig.  1 1  for  Pi  =  [10-3, 0].  As  2"Pi  is  larger  than 
one,  the  benefit  predicted  from  the  erristor  framework  vanishes, 
whereas  a  more  accurate  analysis  based  on  actual  probabilities 
would  still  yield  a  gain.  If  the  receive  energy  is  taking  into  ac¬ 
count,  however,  it  may  not  be  worthwhile  to  spend  energy  on 
listening  to  remote  transmitters,  since  the  incremental  gain  may 
be  comparatively  small.  Therefore,  we  only  consider  transmit¬ 
ters  with  erristances  smaller  than  one,  and  we  limit  m  to  5.  □ 
Finally,  to  demonstrate  the  applicability  of  the  erristor  con¬ 
cept  to  random  networks,  let  us  consider  line  networks  with  uni¬ 
formly  random  node  distribution. 

Example  7.  Random  Line  Networks:  For  long  line  networks 
with  uniformly  random  distribution,  the  internode  distances  D k 
are  exponentially  distributed  and  independent  identically  dis¬ 
tributed  (i.i.d.),  and  without  loss  of  generality,  we  assume  unit 
mean.  For  unit  transmit  power  (P  =  1),  the  erristance  equals 
and  its  pdf  is 

(_j_  \  fa 

~r«  — .  (28) 

/  ar 

A 

6This  name  is  borrowed  from  analog  RLC  filter  design. 


("^  *  \  cx 

3  \ 

•  (29) 

k=l  J 

Since  Dk  is  i.i.d.,  this  can  be  simplified  to  sums  and  products  of 
higher  moments  of  Dk ,  which  were  determined  above.  Fig.  12 
shows  the  gain  Pi  / Rm  for  m  =  2  and  rn  =  3.  The  gain  in¬ 
creases  with  Pm_1,  and  it  is  seen  that  the  higher  the  path  loss 
exponent,  the  higher  the  SNR  has  to  be  in  order  to  get  a  benefit. 

The  minimum  P  necessary  to  see  a  benefit  can  be  derived  ana¬ 
lytically.  Foret  —  m  —  2,  e.g.,  we  find  (E[D4]  +  2E[D3]E[F>]-b 
E[D2]2)/P2  <  E[D2]/P,  or  40/P2  <  2/P,  which  holds  for 
P  >  20,  in  accordance  with  Fig.  12(a).  □ 

V.  Concluding  Remarks 

The  erristor  formalism  permits  the  mapping  of  unwieldy 
probability  expressions  into  a  simple  circuit-like  framework, 
which  greatly  simplifies  the  analysis  and  design  of  transmit 
schemes  that  are  based  on  selection  combining  with  time 
diversity,  path  diversity,  or  a  combination  thereof.  The  wireless 
multicast  advantage  is  incorporated  by  implicit  erristors.  The 
erristor  circuit  is  topologically  equivalent  to  the  network  graph 
and  can,  therefore,  be  drawn  in  a  straightforward  manner. 
Resource  (re)allocation  problems  can  effortlessly  be  solved 
by  simple  arithmetic,  which  makes  a  real-time  implementa¬ 
tion  possible.  A  series  of  examples  demonstrated  not  only 
the  capabilities  of  this  analysis  and  design  tool,  but  also  the 
huge  gain  that  is  possible  from  diversity-based  communication 
strategies,  in  particular  when  high  reliabilities  are  required. 
With  more  complex  combining  techniques,  such  as  maximum 
fktio  combining,  this  gain  could  be  improved  even  further. 
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(a) 

Diversity  m=3 


Fig.  12.  Diversity  gain  for  random  networks  as  a  function  of  the  normalized 
transmit  power  P  for  a  —  2,  3, 4,  5.  (a)  m  —  2 .  (b)  m  —  3 . 


parallel  erristors  does  not  have  a  counterpart  in  electric  circuits. 
Further,  the  erristor  diagram  is  directed ,  it  requires  information 
about  the  direction  of  the  transmission,  i.e.,  who  is  transmitting 
and  who  is  receiving.  To  make  this  clear,  an  arrow  could  be 
used  as  in  SFGs  to  indicate  direction.  This  would  also  add 
timing  information  to  the  diagram,  since  it  would  indicate  a 
natural  sequence  of  the  transmissions. 

Compared  with  SFGs,  the  way  series  and  parallel  branches 
are  treated  is  different.  In  an  SFG,  transmittances  of  serially 
connected  branches  are  multiplied  and  parallel  branches  result 
in  addition,  whereas  in  the  erristor  diagram,  serial  branches  are 
added  and  parallel  branches  are  multiplied.  If  instead  of  NSRs, 
SNRs  were  used,  the  multiplication  of  serial  branches  would 
hold  for  both  graphs,  but  k  parallel  branches  would  have  to 
be  treated  like  parallel  resistors,  yielding  the  harmonic  mean 
divided  by  k.  Further,  the  topology  of  the  SNR-based  circuit 
would  no  longer  be  identical  to  the  original  ad  hoc  network. 

Appendix 

Proof  of  Theorem  2 
Proof:  In  the  inequality  [see  (11)] 

n 

1  -  IT1  -  e~Xi )  - e_  ^”=r  Xi  (3°) 

i= 1 

it  is  easy  to  see  that  equality  holds  (both  sides  are  equal  to  1)  if 
one  of  the  x%  is  zero.  It  remains  to  show  that  /n(  • )  :  Un  — >  U 

n 

fn{x  i,x2,...,xn)  =  1  -  JJ(1  -  e~Xi)  -  e-IUi*  (31) 

i=  1 

is  positive  if  all  X{  are  positive.  We  note  that  /  goes  to  zero  if 
Vi,  Xi  — ►  oo.  Hence,  /  is  positive  for  positive  Xi  if  its  partial 
derivatives  df  jdxi  are  positive  at  zero  and  have  at  most  one 
zero  for  positive  X{.  Since  the  function  is  symmetric  in  all  X{ ,  it 
is  sufficient  to  consider  only  one  partial  derivative.  An  inductive 
technique  is  employed,  discussing  the  case  n  —  2  first. 

Consider 

92 (X)  :=  h (X,  y )  =  e~x  +  e"*  -  e~*^  -  e~xy  (32) 


Further,  the  formalism  is  useful  for  educational  purposes, 
since  the  multiplication  property  of  parallel  erristors  impres¬ 
sively  demonstrates  the  benefits  of  diversity  schemes,  and  the 
series  connection  shows  how  the  noise  and,  in  turn,  the  packet 
loss  probability  accumulate  over  multiple  hops. 


for  a  fixed  y  >  0.  For  y  —  0,  <72(2 0  =  0,  and  for  y  >  0,  we  note 
that  g( 0)  =  0  and  lim^^oo  g(x)  —  e~y  >  0.  Since 

g'(x)  =  V  =  e~x(-l  +  e~v  +  yex^-yh  (33) 
dx 


The  simple  multinomial  expressions  for  end-to-end  erris- 
tances  also  lend  themselves  to  a  sensitivity  analysis,  i.e.,  the 
determination  of  the  sensitivity  of  the  total  erristance  to  the 
individual  erristances  dRtot/dRi.  This  permits  the  allocation 
of  power  where  it  has  the  most  impact. 


there  exists  a  single  local  extremum  x  for  y  >  0  at 

1  .  (  y 


X  = 


y- 1 


In 


1  -  e-y 


(34) 


Erristor  circuits  are  relatives  of  both  electronic  circuits  and  As  y  >  (1  —  e~y )  for  y  >  0,  we  find  x  >  0  for  y  >  1  and 

signal  flow  graphs  (SFGs),  but  some  important  differences  ir<0for0<2/<l.  For  y  —  1,  no  solution  exists,  and  for 

should  be  pointed  out.  The  analogy  to  electronic  resistive  v  =  0 ,g'(x)  =  0.  Since  ^(O)  =  —  1  +  e~y  +  y  >  0,  it  is 

circuits  only  holds  for  serial  elements;  the  multiplication  o^  ^  clear  that  the  extremum  is  a  maximum.  So,  we  have  g(0)  =  0, 
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and  for  0  <  y  <  1  ,g(x)  is  monotonically  increasing,  whereas 
for  y  >  1,  it  is  monotonically  increasing  up  to  x,  and  then 
monotonically  decreasing  to  e~y  >  0.  Hence,  g(x)  >  0  for 
x  >  0,  and  we  have  proven  the  theorem  for  n  =  2. 

Now,  assuming  it  is  true  for  n  —  1,  we  prove  the  induction 
step. 

For  the  general  function  gn(x )  :=  f(x,x with 
fixed  xi  >  0  for  i  >  1,  we  note  that  #n(0)  =  0  and 


n 

lim  =  1  -  TT(1  -  e~Xi)  >  0.  (35) 

X^OO  4-  4- 

v-2 


With  Q  :=  n?=2  xi>  we  get 


Ml -Q) 


(36) 


\  *= 2 

Evaluation  at  x  =  0  yields 

n 

m  =  Q-l\n-^) 

i= 2 

n 

>l-e~Q  ~Y[(l-e~Xi) 

i= 2 

fn—1  (*^2  5  *^3^  •  •  •  5  *^n) 

>  0 


(37) 


where  we  have  made  use  of  the  induction.  Again,  equality  holds 
for  Q  =  0  only.  Solving  #«<>)  =  0  yields  the  single  extremum 


x  = 


1 


o-i 


In 


(nr=2(iQ-e-*o)‘ 


(38) 


We  already  established  in  (37)  that  the  numerator  is  greater  than 
(or  equal  to)  the  denominator  in  the  logarithm,  so,  analogously 
to  (34),  we  find  that  ^<0for0<Q<l  and  x  >  0  for  Q  >  1. 
Again,  no  solution  for  Q  =  1  and  g'(x)  =  0  for  Q  =  0.  ■ 
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We  intend  to  use  this  approach  to  develop  an  embedded  O/S  kernel  that  can 
automatically  upgrade  itself  and  reconfigure  to  changes  in  external  world 
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Use  of  hybrid  system  methodologies  allow  us  to  explicitly  account  for  network’s 
tight  interaction  with  the  external  world 
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Development  Plan 

-  Initial  development  on  QNX  network 

-  Port  to  TinyOS 
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Other  model  problems  are  being  considered 
-  atmospheric  control,  robotic  colonies, 
pursuit  evader  problems 
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Models  should  be  provided  by  application  designer. 
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Real-time  Configuration  of  Networked  Embedded  Systems 
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Subcontractors  and  Collaborators 
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performance. 


PROJECT  TASK  SUMMARY 
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Model  Problem  (SANET)  implemented  with  Berkeley  OEP 
Development  of  Time-Sync  TinyOS  components 
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-  all  task  milestones  achieved  for  first  meeting 

-  no  delayed  milestones 
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PROJECT  PLANS 
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Time  Synchronization  Task  (Lemmon) 
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SUCCESS  METRIC: 

Development  of  T-sync  components  that  allow  synchronous  SANET 
operation  consistent  with  coordination  frequencies  of  10-100  Hz. 


Time  Synchronization  Status  and  Plans 
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TinyOS  component 

Model  Problem  Implementation 
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Model  problem  s  closed  loop  perlormance  witmn  specs. 

Successfully  predict  NEST  middleware  QoS  (delay/dropout)  for  model  problem 
Successfully  enforce  NEST  middleware  QoS  for  model  problem 
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Adaptive  Reconfiguration  -  Synthesis  Task  (Antsaklis) 
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SUCCESS  METRIC: 

-  Successful  generation  of  supervisors  for  NEST  Model  Problem. 


Link  Reconfiguration  Task  (Haenggi) 
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EXAMPLE  -  QoS  Requirements 
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EXAMPLE  -  Model  Problem 
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moveable  robotic  platforms 
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EXAMPLE  -  SANET  ANALYSIS 
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'Reduced  STP  Nel 


EXAMPLE  -  SANET  ANALYSIS 
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ALRM  ACK 


EXAMPLE  -  SUPERVISOR  SYNTHESIS 
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Success  Criteria 

•  Fast  and  optimal  reconfiguration  of  ad  hoc  network  connections 
on  model/challenge  problems. 
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Demonstration  of  TinyOS  NEST  services  on  model  problem. 
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-  connection  quality  vs.  closed  loop  performance. 


Streaming  Service  Demo  -  accomplishments 
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Streaming  Service  Demo  -  future  plans 
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problem  (robotic  formation  control). 

Analytical  validation  of  Empirical  Results 

-  PARTS  toolkit  analysis. 

-  Probabilistic  analyses. 


Miniature  Robotic  Platform 
accomplishments 


CD 

§-E 

o 

2  o) 

O)  C 

.2  ■§ 
CO  CO 
CD  > 
0 
0  J, 

5 '5 

c  S" 

5  a 

0  "O 
O  § 

2§ 

-Q  E 
o  E 

■S  .P 


0 


0 

O 

E 

< 

O 


o 

_Q 

O 


>>  m 

®  Q. 
0 

? 
0  CVJ 
CD  </> 

I 

E 
.o 


0 

E 

o 

-0 

O 

CD 

_c 

'c 

o 

o 

0 


0 


c 

o 


„  ■  -D 
LO  gj 

-g 

CM 


U  0 

0  Q_ 
0  SJ- 
0  o 

II 

.c?  o 
sz  0  *- 

£  S  8 

g  o  V 

O  C  ^ 
-  rS  O 

CD  J 


0 

s=  >>  £  .8 


LO 


0  LL 


CM 


O 
0 

00  c 
v  J  0 

i  0 

g  O 
.b!  0 
0  it 
=  LU 
0 

E 

CO 


0  i2 

_c  0 

c 
0  c 
>  0 

-O  O 

0  ^ 

0  g 

CD> 

12  CL 
-9  c 

-r  0 

-1-  -a 

-0  C 
0  0 
"S  Q- 

JP  q 
O  -D 
0  C 


_>*  0) 


U 
CD 
C 
CD 

'(/) 
CD 
~o 

i  9 


flj  ~  < 

•g  =o 
g-2^ 

oS 

I 


E 

o 


0 

0 

Q.  0 

■E  o 
—  0 
iS  c 

:s>  s 

So 

TD  0 

0  It 
0  CD 

O  = 
0  CO 

ss 

■^3  0 

8-0 
O  0 

II 

LL  Q 


I 


-D 

C 

0 

0 

o 


0 

0 

Q. 

CL 

0 

0 

i_ 

I  « 

§■§ 

CL  £ 

-  2 

d 

0  O 

-0  _cp 

c  0 

CD  s_ 

Q-  0 
0  > 

C  © 

-E  cl 


182 


-  Two  ADC  inputs  for  other  types  of  sensors 

Modified  UCB  Radio  Stack  frees  up  PWM 
and  input  capture  interrupts. 


The  Model  Problem  (future  plans) 
Large-Scale  Robotic  Formation  Control 
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[1]  H.  Yamaguch,  “Adaptive  formation  control  for  distributed  autonomous  mobile  robot  groups”,  IEEE  ICRA,  1997 


PARTS  -  accomplishments  and  future  plans 
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Probabilistic  Link  Analysis 
accomplishments 
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[1]  M.  Haenggi,  Performance  Analysis  of  Ad  Hoc  Wireless  Networks  based  on  a 
Probabilistic  Link  Model,  INFOCOM  2003  (submitted) 


Connection  Analysis 
accomplishments/future  plans 
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Transmit  Probability 


Control  Loop  Performance 
accomplishments/future  plans 
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-  Coordinated  movement  of  5-10  robots 

-  Low  average  tracking  error  and  high  coordination  frequency 


Project  Plans  and  Goals 
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Milestone:  Demo  of  NEST  services  on  UCB’s  OEP. 
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of  Networked  Embedded  Systems 
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Data  Streaming  in  Ad  Hoc  Wireless  Networks 
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Telemetry  provides  a  mechanism  for  monitoring  and  supervising 
the  entire  network. 
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-  Passive  services  simply  listen  for  messages. 

At  a  given  time  instant,  network  nodes  will  not  be  in  the  same 
state.  This  means  all  services  must  be  enabled. 


Notre  Dame’s  NEST  S/W  Architecture 
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Demonstration  Metric:  Majority  of 
Network  Nodes  Blink  their  LEDS  in 
Synchrony. 
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Packet  Fields  -  AUTH/SYNC  Demo 
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Fast  Re-routing  of  open  connection 


Packet  fields  -  Data  Streaming  Demo 
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Relation  to  Challenge  Problem 


< r > 


c 

0 

X 

p 

O 

O 

c 

0 

O 

1 _ ; 

0 

0 

i_ 

X 

1- 

0 

X 

|— 

CO 

LU 

0 

cb 

z 

0 

0 

0 

c 

0 

E 

1 

E 

O 

c 

'1 

< 

c 

C 

O 

© 

0 

c 

g 

H— » 

,0 

p 

0 

0 

O 

'c 

't. 

X 

0 

O 

L— » 
O 
0 
i— 

C 

X 

0 

o 

o 


cl  c o 
o5 

0  0 
x  3 


CD 


CO 

O)  =j 

.E  o 

^  c  I 

Q.  0  05 

E  CO  E 

CO  0) 

TD  O 

O  E 


E 

o 


CO 

E 

0 

CO 

CL 

0 

0)  £ 

O  CD 
~o 

i  § 

0)  ^ 


>»  © 

£  © 
0  - 

X  t 

-0  0 
0  CD 

0 
0 
0 
O 
O 
0  0 

CD 

-Q  ,> 


o 

0 

0 


0 


0  ^ 
o  0 

0 

4-*  ■  — 


0 


c  0 

0  CO 

O  c 

£  S 
I  c 

o  CD 

©  to  £ 

CD  JD  3 
LU  Q.  CD 


CD 


O 

O 


jS  00 

CD  CD 
C  -Q 

TD 

CD 
CD 


O 

8- 


TD 

0 

0 

C 


0 

0 


o 


c 

o 

E 

E 

o 

0 

0 

0 

■g 

> 

o 


0)  e 

.2  I 

>  -Q 

o  2 


0 

N 


> 

(/) 


0 

X 

>.  , 

g  C 

c  0 

CD  C 

C  O 

C/3  ° 
CO  i- 

2  £ 
■D 

TD  .E 
CD  O 
— ■  CL 

"8 

co  .E 
©  "C 
-9  CD 
Q_  © 

c 

CD  © 
_C  I© 

©  I 

-C  CL 

2  2 
0 


O  o 
0  -C= 

X  o 

-*-»  c 
c  ^ 


Q 

o  —  CO  £ 


o 

o' 


"O 

0 

0 

0 

X 

i 

-4— » 

c 

0 

+J 

c 

o 

o 

a 

2 

Q. 

0 


X 

© 

E 

0 

0 

■> 

0 

■ 

t 

E 

0 

L— 

0 

© 

X 

0 

(0 

0 

c 

0 

CL 

0 

© 

a> 

0 

.N 

> 

O 

c 

0 

O  0 
<0 


o 

0 


■a 

0 

0 

0 

E 


o 

a> 

0 

0 

> 

O 

o 

0 

X 

0 

o 

0 

X  CD 

I-  c 

"o  g 
0 

b  -Q 

,2  0 

£_  0 
CL  _Q 


212 


o 

E 

°  E 

©  © 

O  o 

i_ 

CL 


C 

o 


0 
CD 

0  c 

O  0 


0 

X 

o 

0 


<D 

O 

s 

© 

(0 

O) 


c 

o 

E 

E 

o 

o 

© 


©  £ 
.E  © 
>  © 

©  3 
©  © 

e>£ 

x  x 

CL  O 

co  I 

LLI  03 
Z  c 

©  E 

c  © 

©  2 
—  _i_i 

XI 

o 


0 
0 

®  § 

O)  -i— * 

C  CO 
_0  g 
'c 


0 

X 

o 

0 


E 

© 

© 

L. 

| 

(/) 


3 

E 

E 

o 


0 

o  :9 
‘  0 
0 
o 

Q. 

E 


o 

0 

L_ 

0 

a 

o 

c 

0 


9-'co 


0 
O 

.Q  > 
0 


-a 

o 

x 

0 

E 

0 

x 

0 

0 

o 

X 

0 

c 

o 

0 

0 

-g 

> 

o 

CL 

0 

i_ 

0 

X 

-a 

0 

c 

0 

0 

0 


0 

C 

C 

0 

E 


0  0 
0  .> 

X  O 
O  0 

BS 

=3  C 
O  0 


0 

O 


©  © 
o  © 

E  £* 

<8  E 

>,  - 
£  5 

©  © 

E  i— 

© 

.©  i 


^  E 

•  .2 

S-8 

s  - 

0 

0  CD 
C 

__  0 
°  0 

o  O 

>  © 
©  -c 

X  -*— * 

0  M- 

X  O 
0  0 
£  0 

8  S 

> 

©  8 

0  CD 
TD  CD 

C  £ 

2  03 

5  § 

o  1 

E  © 


TD 

0 

0  0 
X  =3 
!"  0 
X 


0 


C 

0 


C 

-Q  O 

-1— * 

0 


O 


0 


CL  ^ 

J  o  £ 

©  Q-  © 

CO  03  -g 

3  .E  2 
w  t  a 

”  iS  © 

0  D) 


0 
£  CO 

0  ^ 
0  12 

L.  > 
0 


C 

0 

0 

X 

o 

0 


o  ^  i= 


o 

0 

0 


CO 

0 

o 

■ 

© 

© 

h 

</) 

LU 


O 

■ 

+-> 

© 


O 

O 

o 


o 

i— 

X 

E  -E 
©  © 
■o  © 

©  .2 

r  © 

o  w 

*•-  CL 

■g  cl 

< 

u  O  "O 


■O 
© 

"m  w 

CD  CD 
c  CL 


©  = 
0 


8  ©  H 

^  CO 

©  LLI 

2z 

-S 

0  9- 
0 

E 

© 

i—  -I— » 
©  «  © 
I-  o  =6 

23  €  & 
z  &8 


0 
X 
-»— » 

0 
3 

E 

© 

©  co 
0  ^ 
>  O 


>  V:  © 


Service  Evaluation  Criteria 


0 

o 

'£ 
0 
c o 


0 

N 


0 

c 

o 

H— • 

CO 


o 

0 

0 

E 

Q 

OG 

J  1 
0  — 
.q  O 

0  g) 

o  ^ 

t  T= 

0  -Q 

C  <- 

o  ±= 

■  —  ■  ““ 

N  CO 


C 

o 

o 

c 

>> 


.N  0 

C 


c  if 

>  o 

0  ^ 

O  0 
O  c 

"3  *5 

0 

CD  C 
0  0 

0 

>  0 


i_ 

O 

£ 

-I— » 

0 

c 

_c 

0 

o 

i_ 

i_ 

0 

c 

o 

-i— > 
0 
N 

'c 

o 


P  o 


o  <  CL  CO 


> 

CO 


0 
-I— » 

0 

0  ^ 

o  o 

s  © 
£  CL 

0  "D 

CO  © 

2^  0 

c:  o 

I  £ 
0  ^ 
0  =5 

±3  CL 


I  I  I 


CO 

o 

'£ 

0 

C 

>> 

O 


0 

0 

s_ 

CD  Z5 
C  = 

£  ^ 

0 

©  E 

£  1 

o  ■© 

E  2 
0  -•-> 
C  CL 

>>  ^ 
■°  _Q 

£  ® 
~o  O 
0  0 


15 

CJ 


0 

> 

o 


D) 

o 


o 

c 

0 


0  0 
*“  0 

3  &■ 

0  0 


CD 

C 


0 
CD 
0 
0 
0 

0  0 
^  q: 


0 

o 


I  I  I 


213 


LU 

C 

o 

■ 

15 

N 


o 

"O 

0 

0 


0 

0 

CD 

E 

*i— > 
0 
0 
•i— > 
0 

£ 

o 


■O 

o 


0  Q 
0  0 
0  0 
0  E 
0 

O  O 
O  LO  0) 

Q-  <:  ig 

j=!  CD 

0 

_  0 
■•£=  O 


o 

CL 

o 


0  0  0 


3 

”  o 

CO 

■2  6  ro 
C/)  CM  LL 


0 

0 

. 

o 

CD 

C 

0 

■o 

o 

U 

s 

C? 

0 

o 

rH 

c 

c 

3 

6 

0 

•+r 

0 

0 

-J 

u 

no 

4— 1 

c 

o 

u 

O 

L_ 

0 

u 

0 

u 

CD 

c 

A—> 

0 

ID 

0 

0 

E 

0 

H 

0 

E 

0 

u 

o 

U 

3 

E 

0 

CL 

0 

C 

■-P 

I — 1 

c 

0 

CD 

c 

■o 

X 

u 

c 

> 

_c 

4-J 

3 

L_ 

o 

u 

c 

> 

In 

4- 

in 

£ 

rH 

0 

• 

• 

• 

^U90J9d 


pi  pi  p-i  p-i  ^  ^ 


jsquunN 


Synchronization  Error 


Time  (Sync  Periods) 


Streaming  Performance 


_co 

o  E 

E  w 

CO  0 
c  > 
>»  CO 
~0  _C 


0 

£ 

0 

■a 

o 

c 


if)  0 


0) 

c 

(/)  '<£ 
>  0 

>  0 

o  *- 

i-  o 
CO  0 

0  "0 
CO  _c 
0 

s  * 

i-  ^ 
0  }= 

II 


O  T3 

£  £ 
•  o 


0 

c 


0 
c 
c 
o 
o 

_  CO 
0  T3 


c 

0 


> 

0 


0 


CO 

C  0 

'E  © 
0 
0 


> 

0 


0 
0 

E 
o 

O  0 
Cl  jD 


0  O 

.9  0 

I  s 

0  0 

0  °- 
>"  CL 

Q  0 


0 

a?  .0 

0  ^ 

0  j? 
E  ~ 
-  "S 

c  0 
0  a) 

-i— >  _T“ 

0  i= 
c  0 
o  > 
o  o 


0 

8  0 

0  o 

£  CL  0 
m  X  0 

0  3  0 

■a  o  2 

200 
Q.  2 

0  Q-  0 

s_  0 


216 


Power  Level 
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ridiculously  expensive. 

Relatively  slow  connect  speeds  make  the  MICAs  difficult  to  use  for  real¬ 
time  control. 
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Real-time  Configuration 
of  Networked  Embedded  Systems 
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Real-time  Reconfiguration 
of^Networked  Embedded  Systems 
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Problem  Description  and  Objectives 
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Outline 
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Red  Force  Tagging  Concept  of  Operations 


Relation  to  Other  NEST  Projects 
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Histogram  of  Sync  Jitter  over  6  hour 
period  (50  node  network) 
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Trajectory  Reconstructed  at  RelayMote 
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T rajectory  Reconstruction 
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X  direction 
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Red  Force  Tagging  Physical  Layout 
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Performance  at  Tampa  Demo 
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Separation  Property  of  Rayleigh  Fading  Channels 
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So,  we  can  focus  on  noise  and  interference  separately. 
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M.  Haenggi,  “On  Routing  in  Random  Rayleigh  Fading  Networks”,  submitted  to 
IEEE  Trans,  on  Wireless  Comm.,  2003. 
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NEST  to  RelayMote  (exfiltration)  =  256  bytes/second  (sustained) 


Program  Issues 
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P.J.  Antsaklis,  M.D.  Lemmon,  M.  Haenggi 

University  of  Notre  Dame 


Problem  Overview  -  Red  Force  Tagging 
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Detailed  Scenario  Description 
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Data  processing,  display  and  evaluation:  Data  transmitted  to  the  Relay 
mote  are  processed  and  displayed  to  show  the  trajectory  traced  by  the  subject 
while  out  of  range  of  sensors. 


Technical  Approach  —  Red  Force  Tagging 
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bldg).  subject’s  trajectory 


road  structure 


Detailed  Scenario  Description  —Software 
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Compensation  for  missing  data 
step-length  estimation,  direction  of  travel  estimation 
Reconstruction  of  complete  trajectory 
Display  of  Trajectory  over  area  map. 
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Relation  to  other  NEST  Projects 
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Real-time  Reconfiguration 
of^Networked  Embedded  Systems 
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Agent  name  &  organization:  Juan  Carbonell,  AFRL/Rome 


Subcontractors  and  Collaborators 
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Problem  Description  and  Objectives 
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Red  Force  Tagging  for  SOCOM 
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Goals  and  Success  Criteria 
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Identify  efficient  communication  policies  (power  and 
throughput)  in  random  networks 


Outline 
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“Red  Force  Tagging” 

Track  subjects  to  gather  pre-conflict  intelligence 


Main  passage 
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Red  Force  Tagging  Field  Experiments 
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Relay  Base  Station 


TagMote  Hardware  Overview 


Leica  Module 


How  it  is  done? 

Technical  Approach  -  TagMote 
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TagMote:  Current  Design 
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Leica  DMC-SX 


Technical  Approach  -  NEST  Middleware 
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Reassembly  of  data  packet; 
Compensation  for  missing  data; 
Reconstruction  of  complete  trajectory; 
Display  of  trajectory  over  area  map. 


289 


Trajectory  Reconstructed  at  RelayMote 
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Heading  Estimation 
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Dynamic  calibration:  Tilt  compensation  using  3-D 
accelerometer  data 

Static  calibration:  Magnetic  field  distortion  (e.g.,  due  to 
human  body) 


Heading  Calibration 
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East  [tn] 


waypoints. 


Overall  Performance 
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Performance-Field  Experiment 
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Enhanced  Backbone  Formation  Service 


296 


0) 

o 

o 

CO 


(0 


X 

LU 

"O 

0) 

o 

£ 

(0 


LU 


0 

o 

c 

CO 

E 


0 

0 

i_ 

O) 

_c 

'c 

c 

0 

Cl 

0 

c 


o 

0 

c 

c 

o 

o 


t  -D 
0  0 
CL  £ 


.©  >- 

-  -Q 

0  T3 
0  0 
" - '  0 

?  g 

.E  o 

D) 

g>  Q. 

uE  -“= 

I—  CD 

o 


0 

0 

O) 

0 

CO  ^in 

8  O 

E  £ 

■D  0 

0  (/) 

0  ' 


0 


C 

o 


E 

-i— » 

o  ro 


0 

■D 

0 

o 

o 

■a 

0 

0 

a 

I 


x 

LU 

~o 

0 

O 

c 

0 

c 

LU 


0 

0 

E 

-t— > 
0 
0 

0 

CL 

CD 

0 

o 


c 

o 

■a 

0 

0 

0 

_Q 

CL 

o 


X 

0 

c 

0 

0 

o 


c/) 

I 


0 

o 

0 

o 

£ 

0 

0 

o 

"0 

0 

_Q 

0 

_c 

0 

0 
-I— > 

o 

E 

"b 

0 

0 

■a 


0 

0 

0 


0 

E 

-i— > 

CD 

O 

■a 

o 

-i— > 

0 

§ 


ns 

+-» 

c 

’& 

■c 

3 

0 

U 

□ 

■ 

0 

O 

£ 

0 

(/) 

~o 

0 

o 

c 

0 

c 

LU 


0 

O 

c 

0 

E 

0 

Q_ 


0 

CL 

o 


0 

_Q 

E 

0 

c 


3 

Cl 

_c 

CD 

3 

O 


CD 

C 

0 

O 

0 

s_ 

0 

0 

C 


0 

CL 

CD 

0 

o 


0 

c 

D) 

!c 

i 


0 


tt) 

LU 


0 

■o 

O 

c 

o 

m 

0 

c 

o 

0 
-i— > 

0 

0 
-i— > 

0 

Q. 

3 

E 


1 

] 

\ 

\ 

\  \  V 

oooooooooo 

iflOlfiOlfiOlfiOlfi 

^rtmfONNHH 


297 


one  hop  two  hops  three  hops 
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average  maximum 


New  HW  Design-Adding  Microprocessor 


Additional  TagMote  HW  Considerations 


o 


co 

(/) 

o 

o 


o 

o 

T3 

O 

<2  Q. 
O)  M— 

O  o 


0) 

-Q 

CO 

E 

E 

CO 

O) 

o 


o 

o 

"D 

0 

CD 


o 

c n 

CD 
c n 

co 

o 

a) 


CD 


0) 

o 


c n 


co 

o 

O)  o 

C D  CO 

£  0 
N 

O)  00 
W  ® 

0  o 

0  "O 
O'  2 


300 


Outline 


</) 


CD 

a 


0 

O 

C 

CD 


O) 

c 

’5) 

O) 

as 

a> 

o 


o 

5 

0 


as 

3 

O 

o 

(0 

c 

0 

CO 


c 

a> 

E 

m 

O) 

as 

c 

(0 


"D 

(0 


£  5 

~o  0 
0  > 
0£  O 


o 

c 

"D 

C 

0 

0 

o 

c 

0 

E 

L_ 

't 

0 

Q. 

C 

o 

0 

o 

"o. 

Q. 

0 

c 

0 

0 


0 

_Q 


£ 

0 

Q_ 

■o 

C 

0 

» 

=  </> 


_Q 

0 
-i— > 

0 

C 

o 

a> 

.E  co 
E 

O  CD 

1  t» 


0 


CO 


I  8 

.2  '£ 

•s 

0  ' 

a:  o 


‘o  ro 
O  o 

-•-»  ■= 

o  E 

0 

c 

a)  >> 
-o  -o 

0  "O 

"O  0 

0  -Q 

o  £= 

0  0 

<  b 


o 

(/) 

c 

0) 

CO 

~D 

0 

~D 

~D 

0 

A 

E 

LU 


(/)  (/> 
0 
3 
(/) 
CO 


o 

I 

0 


301 


E 

“  0 

0  ^ 

o)  5> 

CD  2 

_j  o. 


c 

Q) 

E 

m 

O) 

(0 

c 

(0 


"D 

O 

Q) 

> 

o 


05 


■O 

0 

4— 

T3 

O 

E  E 

CD  0 
O  -g 

o 

■a  q_ 

0  S“L 

=  Q_ 

&o 

0  o 

0  CD 

I -I 

>  o 

CO  QQ 

"o  Cl) 


0 

E 

0 

0 

0 


O 

C 

o 

'0 

i_ 

0 

> 


■a 

0 

=  I 

0  0 
=  -C 

O  o 

c  K 
o  c 
O  0 

■o  E 

0  0 
-I— > 

_Q 


D) 
0 
c 
0 

E 

Q  "D 
>»  0 

1  € 


0 


0 

CL 

CO 

■a 

0 

N 


0 

> 

o 

■a 

0 

c 


03  ^ 

0  J- 
3  ° 
0 
0  0 
CD  c 

O  0 

0  ^ 

0  O 
CD 

CD  0 
W  0 

0  O 

*=  § 

CL  ~ 
O  £ 

■0  0 


CJ  —  p 

“  0  £ 
O  0 

O  x 

_l  LU 


0 

o 

CL 

O 


^  ~o 


0 

fl 

Q-  0 
O  ' 
a  0 

®  0 
■p  -a 

E  0 

0  ~ 

o  " 

2  =5 


302 


I 

G 

O 

U 


o 


[4]  M.  D.  Lemmon,  Q.  Ling  and  Y.  Sun  (2003)  “Overload  management  in  sensor-actuator  networks  used  for  spatially  distributed 
control  systems”,  1st  ACM  conference  on  Sensor  Networks  (SenSys03),  UCLA  2003. 
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h  denotes  the  number  of  hops  (10,  20). 
The  path  loss  exponent  a=4. 
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As  d  grows,  the  performance  of  the  random  network 
approaches  the  performance  of  the  regular  network. 


Separation  Property  of  Rayleigh  Fading  Channels 
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Note  that  this  turns  into  a  “disk  model”  for  a^°° 
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Regular  grid:  gmnK 

in  %  of  optimum 
efficiency 

The  difference  between  the  simplest  MAC  scheme  and  the  optimum  MAC 
scheme  is  only  a  factor  2. 

(M.  Haenggi  and  Xiaowen  Liu,  “Fundamental  Throughput  Limits  in  Rayleigh  Fading 
Sensor  Networks” ,  submitted  to  IEEE  Journ.  On  Sel.  Areas  in  Comm.) 
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The  achievable  gain  is  only  2,  so  it  may  not  be  worth  it. 

Make  sure  the  data  rates  are  high  enough  such  that  the  traffic  is 
light.  Then  simple  random  MACs  are  near-optimum. 
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TagMote  to  NEST  download  =  (1  min  walk  to  15 
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•  SOCOM  Challenge  Problem  Enhancements. 

•  Robust  TagMote  and  Middleware  services.  Wireless  Networks-power 
issues 


Technical  Transition/Transfer 
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Program  Issues 
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Technical  Approach  -  NEST  Middleware 


0) 

O 

0) 

o 

■ 

£ 

<D 

(0 

o 

'</) 

ns 

CD 

ns 

0) 

O 


K  =  0.51  -  0.57 

for  wide  range  of  individuals  on 
different  surfaces 


Heading  Estimation 


325 


Dynamic  calibration:  Tilt  compensation  using  3-D 
accelerometer  data 

Static  calibration:  Magnetic  field  distortion  (e.g.,  due  to 
human  body) 


RED  FORCE  TAGGING 

Trajectory  Reconstruction  GU] 


Post-Reconstruction  Analysis  allows  user  to  adjust  trajectory 
to  known  landmarks 
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Field  Experiment 
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Experiment  1 

(walk  from  UVA  to  ND’s  sitel 


Quit  I j  Draw 


Experiment  2 

(subject’s  walk  north  of  ND’s  site) 


Experiment  3 

(walk  from  UVA’s  site  to  ND’s  sitel 


Experiment  4 
(walk  north  of  ND’s  site) 


Caveats  and  Future  Plans-MacDill  AFB 


>* 

■Q 

■D 

O 

■4-1 


0) 

E 


(/) 

(0 

0) 

> 

(0 

O 


c 

o 

Q. 

D) 

C 

t 

<0 

+-> 

(0 

c 

£ 

o 

c 

* 


o 

N 

"</> 

(0 

0) 

o 


O) 

(0 


CO 


CO 


0  i 
CO  5 


CO 

0 

o 

c 

0 


E 

o 

S  £ 

-S2  5 


w  pi 

^  E 

o 
t 
0 

0. 


0 

c 

O) 

0 

E 

i 


(0 

c 

o 

.5 

L 

0 

> 


o 

0 


o 

CO 

C  -Q 
0  3 
0  0 

I  I 


C 

o 

0 


X 

0 

"D 

c 

0 

"D 

0 

o 

c 

£ 

o 

Q 


> 


-O 

0 

Q. 

0 

O 


0 

"D 

3 


0 


CO 

o 

0 

!n 

3 

0 


334 


Currently  limited  to  a  person  *  Enhanced  by  adding  gyros  and  GPS 

walking/running 


June  2004,  Charleston  SC 


Real-time  Reconfiguration 
of^Networked  Embedded  Systems 
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Agent  name  &  organization:  William  Koenig,  AFRL/Rome 


Subcontractors  and  Collaborators 
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OSU-Extreme  Scaling.  Attended  April  meeting  in  Columbus. 
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Problem  Description  and  Objectives 
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Red  Force  Tagging  for  SOCOM 
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Goals  and  Success  Criteria 
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Red  Force  Tagging 
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Relay  Base  Station 


Benefits 


Technical  Approach  -  TagMote 
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TagMote  Hardware  Overview 


Leica  Module 


TagMote:  Design  II 
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Leica  DMC-SX 


Technical  Approach  -  NEST  Middleware 
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Reassembly  of  data  packet; 
Compensation  for  missing  data; 
Reconstruction  of  complete  trajectory; 
Display  of  trajectory  over  area  map. 
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Trajectory  Reconstructed  at  RelayMote 
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Heading  Estimation 
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Dynamic  calibration:  Tilt  compensation  using  3-D 
accelerometer  data 

Static  calibration:  Magnetic  field  distortion  (e.g.,  due  to 
human  body) 


Heading  Calibration 
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East  [tn] 


waypoints. 


Overall  Performance 
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Performance-Field  Experiment 
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Enhanced  Backbone  Formation  Service 
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(Earlier)  Performance 

-  Low  throughput  caused  by  limited  connectivity  in  spanning  tree 

-  Deadlocks  due  to  missed  messages 
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Enhanced  Download  Service 
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average  maximum 
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Tag  Mote  HW 
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Slave  controls  Sensor  and  Flash  Memory  card. 
Master  in  charge  of  Networking  tasks 


New  HW  Design-Adding  Microprocessor 


TagMote  New  HW 
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Master  when  needed. 


Differential  Storage  and  Transmission 
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Bytes  4. ..23  contain:  the  differential  measurements. 


0 

Q. 

X 

yj 

"D 

C 

0 

0 

Ho 

0 


368 


Additional  TagMote  HW  Considerations 
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Outline 
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Distances  in  Random  Networks 
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particular,  (R.,)m  is  exponentially  distributed. 


Energy  Consumption  in  Random  Networks 
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Energy  Consumption  in  Random  Networks 
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So  the  expected  maximum  energy  consumption  grows  with 


Comparison  of  Regular  and  Random  Networks 
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Random  and  Quasi-Regular  Networks 
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Random  distribution  Quasi-regular  distribution  with 

o=0.2  on  integer  grid. 
“Gaussian  network” 
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Quasi-Regular  Networks  II 
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The  problem  is  that  some  nodes  may  need  to 
be  active  in  more  than  one  period. 


Quasi-Regular  Networks 
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Quasi-Regular  Networks 


379 


We  can  show  that: 

-  more  than  50%  of  the  nodes  are  used  exactly  once 

-  more  than  25%  of  the  nodes  are  not  used. 


Conclusions  for  Network  Architecture 
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SOCOM  Challenge  Problem  Enhancements. 

Robust  TagMote  and  Middleware  services.  Wireless  Networks-power  issues 


Technical  Transition/Transfer 


Program  Issues 
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Real-time  Reconfiguration 
of^Networked  Embedded  Systems 
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Agent  name  &  organization:  William  Koenig,  AFRL/Rome 


Subcontractors  and  Collaborators 
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Problem  Description  and  Objectives 
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Red  Force  Tagging  for  SOCOM 
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Goals  and  Success  Criteria 
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Red  Force  Tagging 
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Relay  Base  Station 


Benefits 


TagMote:  Design  1.2  (single  processor) 
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Leica  DMC-SX 


TagMote  Hardware  (single  processor) 


Leica  Module 
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New  HW  Design-Adding  Microprocessor 


TagMote  New  HW 
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Master  when  needed. 


Tag  Mote  HW:  Design  2.0 
(two  processors) 
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Data  Compression 
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Bytes  4. ..23  contain:  the  differential  measurements. 
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Reassembly  of  data  packet; 
Compensation  for  missing  data; 
Reconstruction  of  complete  trajectory; 
Display  of  trajectory  over  area  map. 


TagMote  Software 
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forwards  the  payload  data  to  the  MODEB  component.  It  also  receives  the  frames  to  be  sent  to  the 
Leica  and  transmits  them  byte  by  byte  to  the  sensor  through  the  UART  component. 

]•  UART:  It’s  the  component  that  manages  the  UART  hardware,  it  initializes  the  hardware  and 
receives  and  transmits  the  bytes  from  and  to  the  UART. 
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The  route  service  provides  means  of  detecting  the  state  of  the  network. 

The  synchronization  service  is  used  by  the  network  and  relay  motes  to  create  a 
global  clock. 

The  telemetry  service  is  useful  for  debugging  purposes.. 
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where  a  is  the  spatial  efficiency. 


L^i=l  1+rf/B 

In  this  case,  a  is  the  “parallel  connection1'  of  1  +  rf/S. 

The  larger  the  distances  r*  and  a,  and  the  smaller  0,  the  better  is  the  spatial 
efficiency. 


Note  that  0  <  popt  <  1/2  and  0  <  gmax  <  1/4.  The  upper  bound  is 
achieved  for  a  — *  oof  the  lower  bound  for  a  —  0. 
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all  the  statistics  of  7  and  the  interference.  This  is  important,  since  the  mean 
interference  for  SIR  is  not  sufficient  to  characterize  the  network.  In  fact,  both 
are  often  infinite  (not  just  for  a  —  2  or  fading  networks)  due  to  the  heavy 
tails  of  their  distributions. 


Outline 
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•  SOCOM  Challenge  Problem  Enhancements. 

•  Robust  TagMote  and  Middleware  services.  Theoretical  studies  of  large-scale 
wireless  networks. 
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